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Remarks 



Upon entry of the foregoing amendment, elaims 24-39, 43- 1 00, 1 05-1 39 and 141-156 
will be pending in the application, with elaims 24, 5 1, 75, 84, 105, 107, 109, 1 1 1, 121, 128, 
137 and 1 49 being the independent claims. Claims 24, 51, 105, 107, 109, 1 1 1, 128, 137 and 
1 54 have been amended taking the Examiner's comments into consideration. New claim 1 56 
was added. This amendment introduces no new matter and entry thereof is respectfully 
requested. 

Support for the amended claims can be found throughout the specification. Support 
for amended claims 24, 51,1 05, 1 07. 1 09 and 1 1 1 can be found, inter alia, at page 9, lines 
27-28; at page 25, lines 30-3 1 ; at page 64, lines 27-29; at page 65. lines 2-8; at page 87, lines 
12-16; and at page 91 . lines 8-9. Support for claim 1 28 can be found, inter alia, at page 23, 
lines 9-13. Support for claim 137 can be found, inter alia, at page 25, lines 28-29; and at 
page 26, lines 14-16. Support for claim 154 can be found, inter alia, at page 27, line 27, to 
page 28, line 4. Support for new claim 1 56 can be found, inter alia, at page 1 8, line 30, to 
page 19, line 9. 

Based on the above amendments and the following remarks. Applicants respectfully 
request that the Examiner reconsider all outstanding objections and rejections and that they 
be withdrawn. 

Objection to the Specification 

The Examiner objected to the disclosure stating that M [t]he 'Brief Description of the 
Drawings' (pages 2 and 3), specifically for Figures 1 A-1C and 4-6, needs to be amended to 
reflect the new labeling present in the formal drawings filed 2/22/0 1 , including where panels 
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in the original figures are now given their own figure numbers." (Paper No. 29, at page 2.) 

Applicants have amended the specification to comply with the Examiner's request. 
Thus, this objection has been rendered moot. 

Rejections under 35 U.S.C. §112 

Claims 24-26, 28-29, 3 1-32, 34-35, 37-38,43-53, 55-56, 58-59, 61-62, 64-65, 67-74, 
79,81-82, 105, 107, 109, 111, 113-120, 125, 127, 132, 134-135, 144, 146-147 and 149-155 
were rejected under 35 U.S.C. § 1 1 2, first paragraph, for allegedly containing subject matter 
which was not described in the specification in such a way as to reasonably convey to one 
skilled in the art that the inventors, at the time the application was filed, had possession of 
the claimed invention. (Paper No. 29, at page 3.) 

Regarding claims 24, 51, 105, 107 and 111 (and their dependent claims), the 
Examiner alleged that M [t]he specification does not teach using 'variant' polypeptides, such 
as those encoded by the broadly claimed polynucleotides, in order to make antibodies against 
the protein of SEQ ID NO:2." Id. Applicants respectfully disagree. 

In contrast to the Examiner's assertions, the specification does teach using variant 
polypeptides, such as those encoded by the claimed polynucleotides, to make antibodies 
against the protein of SEQ ID NO:2. For example, the specification teaches that even if 
deletion of 

one or more amino acids from the N-terminus or C-terminus 
of a polypeptide results in modification or loss of one or 
more biological functions, other biological activities may still 
be retained. For example, the ability of a deletion variant to 
induce and/or to bind antibodies which recognize the protein 
will likely be retained when less than the majority of the 
residues of the protein are removed from the N-terminus or 
C-terminus. 
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The specification at page 16, lines 4-18. Hence, the specification teaches that a PDEF 
deletion variant can induce and/or bind antibodies. The specification goes on to teach that 
other variants include insertions, inversions, repeats and substitutions. Id. 

In addition, the specification describes phenotypically silent amino acid substitutions 
(see page 1 6, lines 1 5-3 1 ). Furthermore, the genetic code is known. Thus, even though the 
claims do not require the nucleic acid to encode a polypeptide identical to SEQ ID NO:2, 
a polynucleotide with a nucleotide sequence different from that of SEQ ID NO:l can still 
produce a polypeptide identical to that of SEQ ID NO:2 which can be used to induce 
antibodies that bind the protein of SEQ ID NO:2. 

However, solely in an effort to advance prosecution, Applicants have amended the 
claims to recite that the nucleic acid encodes a polypeptide which binds with specificity to 
antibodies having specificity for a polypeptide consisting of amino acids 1 to 335 of SEQ 
ID NO:2. Accordingly, withdrawal of this rejection is respectfully requested. 

Regarding claims 46, 48, 70, 72,79,81, 116, 118, 125, 127, 132, 134, 144, 146, 149 
and claims dependent therefrom, the Examiner alleged that "[t]he terms 'heterologous 
regulatory sequence' and 'regulatory elements' do not appear in the specification as originally 
filed, nor does any generic term equivalent to these terms" and "[i]t is unclear just what these 
terms are intended to convey. There is no clear support for these limitations in the original 
specification." (Paper No. 29, at page 4.) Applicants respectfully traverse this rejection. 

First of all, the term "regulatory regions," which is a generic term equivalent to 
"regulatory sequence," and the term "regulatory elements" appear throughout the 
specification (see specification at page 1, lines 13-18; page 36, lines 18-21; and page 99, 
lines 3-8). In addition, there are many examples of regulatory/control sequences described 
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in the specification, including ribosome binding sites and initiator A TG, which are 
translational control sequences. See also Current Protocols in Molecular Biology, 
Volume 2, pp. 1 6. 1 . 1 - 1 6. 1 .3 ; and 16.12.1-16.12.6 (Attachments A and B). Other examples 
include enhancers, promoters and polyadenylation signal sequences (see page 27, line 27, 
to page 28, line 4, of the specification), which are transcription regulatory elements (see also 
Current Protocols in Molecular Biology, Volume 2, pp. 1 6. 1 3. 1-1 6.1 3. 7)( Attachment C). 
Additionally, the expression vectors disclosed as being part of the invention contain other 
regulatory sequences (see, e.g., Pharmacia BioTech, BioDirectory, Section 5: cDNA, 
Cloning & Vectors, pp. 91-93, 1997)(Attachment D). 

This large number of disclosed regulatory elements constitutes an adequate written 
description for transcriptional, translational and heterologous regulatory sequences and 
would be recognized as such by one of ordinary skill in the art. The specification 
contemplates use of any known sequences; thus, the invention is not limited to the particular 
elements disclosed. Thus, there is clear support for these terms in the specification. One 
skilled in the art reading the specification would immediately understand what the terms 
"heterologous regulatory sequence" and "regulatory elements" are intended to convey. 

The specification also teaches, at page 54, lines 15-23, that the present invention 
includes an expression vector comprising phage operator and promoter elements operatively 
linked to a PDEF polynucleotide. The vector further contains an £. coli origin of replication, 
a T phage promoter sequence, two lac operator sequences, a Shine-Delgarno sequence and 
the lactose operon repressor gene. All of these sequences are examples of other types of 
regulatory' sequences or elements. See also page 52, lines 21 to 30; page 56, lines 20 to 3 1 ; 
page 57, lines 1 to 15; page 59. lines 8-20; page 60. lines 7-14; page 93, lines 26-28; page 
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94, lines 17-18; and page 99, lines 17-19 and 24-26. Therefore, the specification conveys 
with reasonable clarity that Applicants possessed the claimed invention at the time the 
application was filed. 

Furthermore, the absence of definitions or details for well-established terms or 
procedures should not be the basis of a rejection under 35 U.S.C. § 1 12, first paragraph, for 
lack of adequate written description. See 66 Fed. Reg. 1099, 1 105 (Jan. 5, 2001 )(inverse 
correlation between level of skill and knowledge and specificity of disclosure needed to 
satisfy written description; written description need only describe in detail what is new or 
not conventional.); and Spectra-Physics, Inc. v. Coherent, Inc., 827 F.2d 1524, 1534 (Fed. 
Cir. 1987)("[a] patent need not teach, and preferably omits, what is well known in the art.") 
Accordingly, withdrawal of this rejection is respectfully requested. 

The Examiner further alleged that claim 154 "does not limit the identity of the 
polypeptide to that encoded by the recombinant polypeptide, nor even that the cell used be 
able to transcribe or translate the recombinant polynucleotide" and "[c]laim 154 should be 
amended to provide some nexus between the 'polypeptide' recited in claim 1 54 and the '60 
contiguous amino acids' recited in claim 149." (Paper No. 29, at page 5.) 

Applicants respectfully disagree. However, in an effort to advance prosecution. 
Applicants have amended claim 1 54 as the Examiner suggested. Thus, withdrawal of this 
rejection is respectfully requested. Applicants point out that the amendment does not narrow 
the claim in any way since Applicants intended the claim to encompass the polynucleotide 
recited in claim 149. 

The Examiner maintained the rejection of claims 137-139 and 141-148 under 35 
U.S.C. § 1 12. first paragraph, as allegedly containing subject matter w hich was not described 
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in the specification in such a way as to reasonably convey to one skilled in the relevant art 

that the inventors, at the time the application was filed, had possession of the claimed 

invention. (Paper No. 29, at page 5.) According to the Examiner, 

there is no apparent support for the generic embodiment 
instantly claimed with respect to "nucleotide sequence 
heterologous to SEQ ID NO:l" in the context of a fusion to 
nucleic acid encoding an epitope of PDEF. There is no 
evidence of record that such a generic embodiment was 
contemplated by the inventors at the time the invention was 
made. 

Applicant's arguments filed 2/7/01 have been fully 
considered but they are not persuasive. The amendment does 
not obviate the prior grounds of rejection since the claim still 
embraces a genus for which there is no description in the 
original specification, embracing embodiments where the 
"nucleotide sequence heterologous to SEQ ID NO:l n does 
not encode a "second protein" (see specification page 25, 
lines 28-29). The phrase "fused in frame" has no meaning in 
the context of the claim as written because there is no 
indication of any element contained in the "nucleotide 
sequence heterologous to SEQ ID NO: 1 " that has a "frame." 

Id. at page 6. Applicants respectfully disagree. 

The specification, inter alia, at page 26, lines 14-16, teaches that PDEF polypeptides, 
including fragments, and specifically epitopes, can be combined with parts of the constant 
domain of immunoglobulins (IgG), e.g., a nucleotide sequence heterologous to SEQ ID 
NO:l, to form chimeric polypeptides (emphasis added). 

Furthermore, the specification at page 46, lines 10-13, teaches that the PDEF 
promoter may be fused to a coding sequence to obtain prostate specific expression of the 
coding sequence (emphasis added). One skilled in the art reading the above disclosure 
would thus understand how to fuse an epitope in frame to a nucleotide sequence 
heterologous to SEQ ID NO: 1 . as recited in claim 137. Therefore, the meaning of the phrase 



'fused in frame" is clear in the context of the claim as written. However, in an effort to 
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advance prosecution. Applicants have amended claim 1 37 to indicate that the heterologous 
polynucleotide encodes a heterologous polypeptide. Thus, the nucleotide sequence 
heterologous to SEQ ID NO: 1 encodes "a second protein." Accordingly, withdrawal of this 
rejection is respectfully requested. Applicants note that this amendment does not narrow the 
claims in any way since Applicants intended the claim to encompass a fusion protein. 

The Examiner maintained the rejection of claims 24-26, 28, 29, 31, 32, 34, 35, 37, 
38, 40, 41, 43-53, 55, 56, 58, 59, 61, 62, 64, 65, 67-74, 105, 107, 109, 1 1 1 and 113-120 
under 35 U.S.C. § 1 12, first paragraph, because allegedly "the specification, while being 
enabling for a 'nucleic acid' that encodes SEQ ID NO:2 or a fragment of SEQ ID NO:2 (as 
recited in the claims), does not reasonably provide enablement for polynucleotides that do 
not encode SEQ ID NO:2 or a recited fragment of SEQ ID NO:2." (Paper No. 29, at page 6.) 
According to the Examiner, M [t]he specification does not teach using Variant' polypeptides, 
such as those encoded by the broadly claimed polynucleotides, in order to make antibodies 
against the protein of SEQ ID NO:2." Id. at page 7. 

Applicants respectfully submit that the claims are not to methods of making 
antibodies. Instead, the claims are to nucleic acid variants that are 90% identical to nucleic 
acids that encode SEQ ID NO:2 or portions thereof. Methods of making such variants are 
well known and taught in the specification. See the specification, inter alia, at page 1 1, 
line 8, to page 1 7, line 2 1 . The Examiner contends that the claims require raising antibodies 
against specified variants. Although the specification enables such a method, the claims do 
not require it . Whether such variant polypeptides bind to an antibody specific to SEQ ID 
NO:2 can be readily determined using an antibody raised against SEP ID NO:2 , not against 
the variant as the Examiner asserts is required by the claims. 
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Furthermore, it is well established that such variants would be useful to examine the 
protein of SEQ ID NO:2. for example, it is well established in the art of molecular biology 
that such variants can be routinely made and used in epitope-mapping studies. See, e.g.. 
Ikeda ei al, "Epitope mapping of anti-recA protein IgGs by region specified polymerase 
chain reaction mutagenesis,"./. Biol. Chem. 267:6291-6296 (1 992 )( Attachment E). This is 
merely routine experimentation. Thus, the claims are fully enabled by the specification. 

Moreover, as Applicants discussed above, the specification does teach that a 
polynucleotide with a sequence different from that of SEQ ID NO:l can still produce a 
polypeptide identical to that of SEQ ID NO:2 which can be used to induce antibodies that 
bind the protein of SEQ ID NO:2. In addition, the specification at page 16, lines 4-18, 
describes deletion variants which can be used to induce antibodies that bind the polypeptide 
of SEQ ID NO:2. The ability of a deletion variant to induce and/or to bind antibodies which 
recognize the full-length protein will likely be retained when less than the majority of the 
residues of the protein are removed from the N-terminus or C-terminus. The specification 
goes on to teach that other variants include insertions, inversions, repeats and substitutions. 
Thus, in contrast to the Examiner's statement that M [f]or making antibodies that bind to 
PDEF, the specification (pages 24-25) teaches to use PDEF (SEQ ID NO:2) or an antigenic 
fragment of PDEF; no mention is made of using variant polypeptides for this purpose," 
(Paper No. 29, at page 9), the specification clearly teaches that PDEF variants can be used 
for making antibodies. 

In addition, the experimentation required to make and use such polynucleotides 
requires little if any ingenuity and. similar to the situation in In re Wands. 8 U.S.P.Q.2d 1400 
(C.A.F.C. 1988), is merely routine experimentation for one skilled in the art of molecular 




- 15 - EIBERMANN eta/. 

Appl. No. 09/126,945 

biology. Thus, in contrast to the Examiner's assertions, the situation here meets the tests in 
Fields v. Conovet\ 170 U.S.P.Q. 276, 279 (C.C.P.A. 1971). 

Furthermore, the Examiner has provided no evidence that one skilled in the art of 
molecular biology would be unable to make and use nucleic acid variants that are 90% 
identical to nucleic acids that encode SEQ ID NO:2 or portions thereof. Instead, the 
Examiner has merely recited case law to support his assertions. 

However, solely in an effort to advance prosecution, Applicants have amended the 
claims to recite that the nucleic acid encodes a polypeptide which binds with specificity to 
antibodies having specificity for a polypeptide consisting of amino acids 1 to 335 of SEQ 
ID NO:2. Thus, withdrawal of this rejection is respectfully requested. 

The Examiner further rejected claims 24-39, 43-74 and 105-120 under 35 U.S.C. 

§ 1 1 2, second paragraph, as allegedly being indefinite for failing to particularly point out and 

distinctly claim the subject matter which Applicants regard as the invention. (Paper No. 29, 

at page 11.) According to the Examiner, 

[c]laims 24, 51, 105, 107 and 111 (and their dependent 
claims) recite the limitation that the nucleic acid encodes "a 
polypeptide which generates an antibody that binds a 
polypeptide consisting of amino acids 1 to 335 of SEQ ID 
NO:2" (emphasis added). First, it is unclear what the term 
"generates" means in this context. . . . Second, the meets and 
bounds of the claim are unclear with respect to what "binds" 
is intended to mean in this context. 

Id. at pages 11-12. 

Applicants respectfully disagree. However, as indicated above in response to the 35 
U.S.C. § 1 12, first paragraph, rejection. Applicants have amended the claims in an effort to 
advance prosecution. The amended claims do not recite "generates" and are clear with 
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respect to what "binds" means. Accordingly, withdrawal of this rejection is respectfully requested. 

Rejections under 35 U.S.C* § 102 

Claims 128-136 remained rejected under 35 U.S.C. § 102(a) as allegedly being 
anticipated by GenBank Accession No. AA662204. (Paper No. 29, at page 12.) According 
to the Examiner, M [t]he prior art polynucleotide includes codons for a DS dipeptide 
corresponding to SEQ ID NO:2 positions 283-284, respectively, mentioned above, which 
anticipates the claims when m and n are either 237 and 238, respectively, or 241 and 242, 
respectively." Id. 

Applicants have amended claim 128 to recite that m is an integer from 2 to 236 and 
n is an integer from 243 to 335. Thus, the cited dipeptide is not encompassed by the claim. 
Support for this amendment can be found at pages 1 9-23 of the specification. Accordingly, 
this rejection has been rendered moot. 

The Examiner further maintained the rejection of claims 128-136 under 35 U.S.C. 
§ 102(b) as allegedly being anticipated by Chen et al^ Dev. Biol 757:176-191 (1992). 
(Paper No. 29, at page 13.) 

According to the Examiner, M [t]he ets-4 polypeptide has a DS dipeptide at residues 
60-61 , which anticipates the claims when m and n are either 237 and 238, respectively, or 
241 and 242, respectively." Id. 

As indicated above. Applicants have amended claim 1 28 to recite that m is an integer 
from 2 to 236 and n is an integer from 243 to 335. Thus, the cited dipeptide is not 
encompassed by the claim. Accordingly, this rejection has been rendered moot. Applicants 
emphasize that the value "m is an integer from 2 to 236" and "n is an integer from 243 to 
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335" is not critical to the invention. Thus, this amendment should not be construed as a 
surrender of equivalent embodiments wherein m is more than 236 or n is less than 243 that 
do not read on the cited references. 

Double patenting 

The Examiner again advised Applicants that should claims 27, 30, 33. 36 and 39 be 
found allowable, claims 54, 57, 60, 63 and 66 would be objected to under 37 C.F.R. § 1 .75 
as being a substantial duplicate thereof. (Paper No. 29, at page 13.) 

In response, Applicants note that they are entitled to claim the invention using 
multiple claims, so long as the claim set as a w hole clearly defines the subject matter of the 
invention. See M.P.E.P. § 2173.05(n). Further, Applicants note that claims 27, 30, 33, 36 
and 39 and 54, 57, 60, 63 and 66 are not duplicative, because claims 27, 30, 33, 36 and 39 
are drawn to an isolated polynucleotide comprising a nucleic acid at least 90% identical to 
a reference nucleic acid encoding certain amino acids of SEQ ID NO:2, whereas claims 54, 
57, 60, 63 and 66 are drawn to an isolated polynucleotide comprising a nucleic acid 
encoding an amino acid sequence at least 90% identical to a reference amino acid 
sequence encoding certain amino acids of SEQ ID NO:2. Thus, the claims are of different 
scope. Accordingly, withdrawal of this objection is respectfully requested. 

Allowable Subject Matter 

The indication that claims 75-78. 80, 83- 1 00, 1 2 1 - 1 24 and 1 26 are allowed is noted 
and appreciated by Applicants. 
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Conclusion 

All of the stated grounds of objection and rejection have been properly traversed, 
accommodated, or rendered moot. Applicants therefore respectfully request that the 
Examiner reconsider all presently outstanding objections and rejections and that they be 
withdrawn. Applicants believe that a full and complete reply has been made to the 
outstanding Office Action and, as such, the present application is in condition for allowance. 
If the Examiner believes, for any reason, that personal communication will expedite 
prosecution of this application, the Examiner is invited to telephone the undersigned at the 
number provided. 

Prompt and favorable consideration of this Amendment and Reply is respectfully 
requested. 



Respectfully submitted, 

Sterne, Kessler, Goldstein & Fox p.l.l.c. 

Gaby L. Longsworth 
Agent for Applicants 
Registration No. 47,756 

Date: "L, 2 -\ ( /CCl 

1 100 New York Avenue, N.W. 
Suite 600 

Washington, D.C. 20005-3934 
(202) 371-2600 
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Version with markings to show changes made 

//; the specification: 

Fourth full paragraph on page 2. lines 24-26: 

Figures [ 1 A-C] 1 A-B show the nucleotide sequence (SEQ ID NO: 1 ) and the deduced 
amino acid sequence (SEQ ID NO:2) of PDEF. Regions of conservation to the pointed and 
Ets domains are indicated by single and double underline, respectively. 

Second full paragraph on page 3, lines 16-22: 

[Figure 4 shows] Figures 4A-C show the tissue distribution of PDEF expression in 
different human fetal (Figure 4 A) and adult tissues (Figures 4B-C) by Northern 
hybridization. The blots were sequentially probed with PDEF (upper panel), ESE- 1 (middle 
panel) and GADPH cDN A probes (lower panel) under stringent conditions using poly(A)+ 
mRNA from the indicated tissues (See Example 3 of present invention). A skilled artisan 
would readily associate the intensity and location of the bands with respect to the blot as 
indicative of both the abundance and size of the PDEF mRNA within each tissue. 

Third full paragraph on page 3, lines 24-28: 

[Figure 5 shows] Figures 5A-B show the tissue distribution of PDEF expression 
within poly(A)+ mRNA from human fetal and adult tissues by Dot Blot Hybridization. The 
blot was probed with PDEF under the conditions described in Example 3 of present 
invention. A skilled artisan would readily associate the intensity of the dots as indicative of 
both the abundance of the PDEF mRNA within each tissue. 

Fourth full paragraph on page 3, lines 30-33: 

[Figure 6 shows] Figures 6A-D show [in situ] in situ hybridization studies. Paired 
brightfield (A, C) and corresponding polarized fluorescence (B, D) photomicrographs. 
Intense labeling of prostate epithelium in normal lung is seen with antisense probe to PDEF 
mRNA (A, B). No labeling is seen with control sense probe (C D). 

In the claims: 

Please rewrite the claims as follows: 

24. (Twice Amended) An isolated polynucleotide comprising a first nucleic acid 
at least 90% identical to a reference nucleic acid selected from the group consisting of: 

(a) a nucleic acid encoding amino acids 142 to 21 1 of SEQ ID NO:2; 

(b) a nucleic acid encoding amino acids 248 to 33 1 of SEQ) ID N( ):2: 

(c) a nucleic acid encoding amino acids 2 to 335 of SEQ ID NO:2; 

(d) a nucleic acid encoding amino acids 1 to 335 of SI:Q ID N():2; and 
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(e) a nucleic acid encoding the complete amino acid sequence encoded 
by the cDNA clone contained in ATCC Deposit No. 203072; 

wherein said first nucleic acid encodes a polypeptide which [generates an antibody that) 
binds with specificity to antibodies having specificity for a polypeptide consisting of amino 
acids 1 to 335 of SEQ ID NO:2. 

51. (Twice amended) An isolated polynucleotide comprising a nucleic acid 
encoding a first amino acid sequence at least 90% identical to a reference amino acid 
sequence selected from the group consisting of: 

(a ) amino acids 1 42 to 2 1 1 of SEQ ID NO:2; 

(b) amino acids 248 to 33 1 of SEQ ID NO:2; 

(c ) amino acids 2 to 335 of SEQ ID NO:2; 

(d) amino acids 1 to 335 of SEQ ID NO:2; and 

(e) the complete amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 203072; 

wherein said nucleic acid encodes a polypeptide which [generates an antibody that] binds 
with specificity to antibodies having specificity for a polypeptide consisting of amino acids 
1 to 335 of SEQ IDNO:2. 

1 05 . (Three times amended ) An isolated polynucleotide comprising a nucleic acid 
at least 95° o identical to a nucleic acid encoding at least 70 contiguous amino acids of SEQ 
IDNO:2; 

wherein said nucleic acid encodes a polypeptide which [generates an antibody that] binds 
with specificity to antibodies having specificity for a polypeptide consisting of amino acids 
1 to 335 of SEQ ID NO:2. 

1 07. ( Three times amended ) An isolated polynucleotide comprising a nucleic acid 
at least 95° o identical to a nucleic acid encoding at least 80 contiguous amino acids of SEQ 
IDNO:2; 

wherein said nucleic acid encodes a polypeptide which [generates an antibody that] binds 
with specificity to antibodies having specificity for a polypeptide consisting of amino acids 
1 to 335 of SEQ ID NO:2. 

109. (Twice amended) An isolated polynucleotide comprising a nucleic acid at 
least 95° o identical to a nucleic acid encoding at least 100 contiguous amino acids of SEQ 
IDNO:2; 

wherein said nucleic acid encodes a polypeptide which [generates an antibody that] binds 
with specificity to antibodies having specificity for a polypeptide consisting of amino acids 
1 to 335 of SEQ ID NO:2. 

111. (Twice amended) An isolated polynucleotide comprising a nucleic acid at 
least 95° o identical to a nucleic acid encoding at least 150 contiguous amino acids of SEQ 
IDNO:2; 

wherein said nucleic acid encodes a polypeptide which [generates an antibody that] binds 
with specificity to antibodies having specificity for a polypeptide consisting of amino acids 
1 to 335 of SEQ IDNO:2. 
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1 28. (Three times amended) An isolated polynucleotide comprising a nucleic acid 
encoding an amino acid sequence from position m to position n of SEQ ID NO:2, wherein 
m is an integer from 2 to [276] 236, n is an integer from [141] 243 to 335; and m is less 
than n. 

137. (Three times amended) A polynucleotide comprising a nucleic acid fused in 
frame to a nucleotide sequence heterologous to SEQ ID NO:l„ wherein said heterologous 
nucleotide sequence encodes a heterologous polypeptide, and wherein said nucleic acid is 
selected from the group consisting of: 

(a) a nucleic acid encoding amino acids 279 to 287 of SEQ ID NO:2; 

(b) a nucleic acid encoding amino acids 292 to 300 of SEQ ID NO:2; and 

(c) a nucleic acid encoding amino acids 317 to 325 of SEQ ID NO:2. 

1 54. (Once Amended) A method of producing [a] the polypeptide encoded by the 
polynucleotide of claim 149, comprising culturing [the] a host cell [of claim 153] 
comprising said polynucleotide under conditions such that [a] said polypeptide is expressed, 
and recovering said polypeptide. 



New claim 1 56 was added. 



Attacfflent A 



EXPRESSION OF PROTEINS IN 
ESCHERICHIA COLI 

Overview of Protein Expression in E. coli 



The study of Escherichia coli during the 
1960s and 1970s made it the best understood 
organism in nature (Chapter 1 ). Today's recom- 
binant DNA technology is a direct extension of 
the genetic and biochemical analyses carried 
out at that time. Even before the advent of 
molecular cloning, genetically altered E. coli 
strains were used to produce quantities of pro- 
teins of scientific interest. When cloning tech- 
niques became available, most cloning vectors 
utilized E. coli as their host organism. Thus, it 
is not surprising that the first attempts to ex- 
press large quantities of proteins encoded by 
cloned genes were carried out in E. coli 

E. coli has two characteristics that make it 
ideally suited as an expression system for many 
kinds of proteins: it is easy to manipulate and 
it grows quickly in inexpensive media. These 
characteristics, coupled with more than 10 
years' experience with expression of foreign 
genes, have established E. coli as the leading 
host organism for most scientific applications 
of protein expression. 

Despite a growing literature describing suc- 
cessful protein expression from cloned genes, 
each new gene still presents its own unique 
expression problems. No one, and certainly no 
laboratory manual, can provide a set of meth- 
ods that will guarantee successful production 
of every protein in a useful form. Nevertheless, 
the vast body of accumulated knowledge has 
led to a general approach that often helps to 
solve specific expression problems. This unit 
introduces general considerations and strate- 
gies, while subsequent units {16.2-16.7) describe 
procedures that can be applied to specific ex- 
pression problems. 

GENERAL STRATEGY FOR GENE 
EXPRESSION IN E. COLI 

The basic approach used to express all for- 
eign genes in E. coli begins with insertion of 
the gene into an expression vector, usually a 
plasmid. This vector generally contains several 
elements: (1) sequences encoding a selectable 
marker that assure maintenance of the vector 
in the cell; (2) a controllable transcriptional 
promoter (e.g., lac, trp, or tac) which, upon 
induction, can produce large amounts of 



mRNA from the cloned gene; (3) translational 
control sequences, such as an appropriately 
positioned ribosome-binding site and initiator 
ATG; and (4) a polylinker to simplify the in- 
sertion of the gene in the correct orientation 
within the vector. Once constructed, the ex- 
pression vector containing the gene to be ex- 
pressed is introduced into an appropriate E. coli 
strain by transformation (unit in). 

SPECIFIC EXPRESSION 
SCENARIOS 

Although this general approach — insertion 
of the gene of interest into an expression vector 
followed by transformation in E. coli — is com- 
mon to all expression systems, specific proce- 
dures differ greatly. When choosing a pro- 
cedure, it is helpful to consider the final appli- 
cation of the expressed protein, as this often 
dictates which expression strategy to use {unit 
16.4 a). 

Antigen Production 

If the goal is to use the expressed protein as 
an antigen to make antibodies, several ap- 
proaches are available to make protein reliably 
and to allow for rapid purification of the anti- 
gen. The two best approaches are synthesis of 
fusion proteins with specific "tag" sequences 
that can be retrieved by affinity chromatogra- 
phy {UNITS 16.5, 16.6. 16.7 & 16.8; See also UNIT 10.11 B) 

and synthesis of the native protein, or a frag- 
ment of it, under conditions that cause it to 
precipitate into insoluble inclusion bodies 
{units 16.4a & 16.6). These inclusion bodies can 
be purified sufficiently by differential centrif- 
ugation so preparative denaturing polyacryl- 
amide gel electrophoresis {unit 10.2) will yield 
an isolated band that can be cut out and crushed, 
or electroeluted {unit 10 5), to provide antigenic 
material for injection into an animal. 

Biochemical or Cell Biology Studies 

If the goal is to use the expressed protein as 
a reagent in a series of biochemical or cell 
biology experiments, other considerations are 
relevant. In this case, the authenticity of the 
protein's function (e.g., high-specific -activity 
enzyme, binding protein, or growth factor) is 
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very important, while the ease of preparing the 
protein matters less. For this application, it is 
possible to express the protein as a fusion pro- 
tein containing a specific protease-sensitive 
cleavage site so the N-tcrminal peptide tail can 
be removed easily, leaving only the native 
amino acid sequence (units 16.4. 16 6, 16 7 & i6.s). 
Alternatively, direct expression vectors of the 
type described in units I6.2 & 16.3 may be used to 
produce the authentic primary sequence. When 
expressed, the protein may be soluble and ac- 
tive, as is the case with many intracellular 
enzymes. If it is insoluble, as is the case for 
many secreted growth factors when they are 
made cytoplasmically in E. coli, it may be 
necessary to isolate inclusion bodies, solubilize 
the protein using denaturing agents, and refold 
the protein. Refolding is usually not too diffi- 
cult when the protein is of moderate size (Mar- 
ston and Hartley, 1990). Whether the protein is 
expressed in a soluble form or whether it re- 
quires refolding, its integrity can usually be 
checked by specific enzyme assays or by bio- 
assays. 

Structural Studies 

If the goal is to do structural studies of the 
expressed protein, the greatest constraints are 
imposed on the expression system. Because it 
is nearly impossible to show that a protein of 
unknown structure has been precisely refolded 
after denaturing, the protein must generally be 
made in a soluble form so its purification does 
not require a denatu rati on/renatu rati on step. 
Usually, the soluble form of the protein — either 
intracellular or secreted — must be made in 
strains and by induction protocols that mini- 
mize proteolytic degradation. 

Soluble expression of most eukaryotic pro- 
teins is best achieved with systems that allow 
induction of synthesis without changing the 
temperature; for example, by inducing tran- 
scription from the trp (Edman et al., 1981; de 
Boer et al., 1983) or tac (de Boer et al., 1983) 
promoters. Maximum accumulation of soluble 
product is best achieved by testing expression 
in several strains and at several temperatures, 
and picking the combination that works best. 
This is an active area of research at present 
(Schein, 1989); the rules are not yet understood, 
so little more than trial and error can be recom- 
mended. 

TROUBLESHOOTING GENE 
EXPRESSION 

Once an expression strategy has been cho- 
sen and the gene is introduced into an appro- 




priate expression vector, several strains of E. 
coli should be transformed with the vector and 
protein production should be monitored. Ide- 
ally, the protein of interest will be produced in 
an active form and in sufficient amounts to 
allow its isolation. Often, however, the protein 
will be made either in very small amounts or in 
an insoluble form, or both. If this happens, there 
are various approaches that may correct the 
problem. 

// not enough protein is produced: 

1 . Reconstruct the 5'-end of the gene, max- 
imizing its A+T content while preserving the 
protein sequence it encodes. This may reduce 
secondary structure within the mRNA (De- 
Lamarter et al., 1985), or it may alter an as yet 
undefined parameter of the reaction. Regard- 
less of the underlying cause, this procedure 
usually increases translation efficiency. 

2. Determine if a transcriptional terminator 
is present. If the vector does not have a tran- 
scriptional terminator downstream from the 
site at which the gene is inserted, put one in. 
This often aids expression, probably by in- 
creasing mRNA stability and by decreasing 
nucleotide drain on the cell. 

3. Examine the sequence of the cloned gene 
for codons used infrequently in E. coli genes. 
These so-called rare codons are usually not a 
rate-limiting problem, but if four or more hap- 
pen to occur contiguously, they can reduce 
expression significantly (Robinson et al., 
1984), perhaps by causing ribosomes to pause. 
Ribosomal pausing can uncouple transcription 
from translation, leading to premature termina- 
tion of the message. Even if transcription pro- 
ceeds normally, the mRNA 3' to the stalled 
ribosomes can be exposed to degradation by 
host ribonucleases, reducing its stability. Thus, 
if stretches of rare codons are found, they 
should be altered to codons more favorable to 
high expression in E. coli. 

If enough protein is produced, but it is in- 
soluble when the application requires it to 
be active and soluble: 

1. Vary the growth temperature. As men- 
tioned above, many proteins are more soluble 
at lower than at higher temperatures (Schein 
and Noteborn, 1988). On the other hand, some 
enzymes have a higher specific activity when 
made at temperatures >37°C (J. McCoy and P. 
Schendel, unpub. observ.). E. coli cm synthe- 
size proteins at temperatures ranging from 10° 
to 43°C, so trying expression at different tem- 
peratures is often worthwhile. 
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2. Change fermentation conditions. Many 
proteins contain metals as structural and cata- 
lytic colactors. If the protein is being made 
faster than metals can be transported into the 
cell, the apoprotein without its metal cofactor 
will accumulate. This apoprotein will not fold 
correctly and will likely be insoluble. At the 
very least, the average specific activity of the 
expressed protein will be lower than expected. 
Different media and metal supplements can be 
tested and the best combination used. Clearly, 
if there is information about the metal content 
of the protein, these supplements can be de- 
signed more rationally. If no information is 
available, a more random approach must be 
tried. 

3. Alter the rate of expression by using 
low-copy-number plasmids. This can be done 
by using the pACYC family (Chang and 
Cohen, 1978) or using single-copy chromo- 
somal inserts of the cloned gene into a suitable 
target gene (Hamilton etal., 1 989). Such reduc- 
tions in gene dosage often reduce the final yield 
of protein, but the slower kinetics of synthesis 
they afford can sometimes result in production 
of soluble proteins. 

To restate the obvious, protein expression is 
an inexact science at present. However, most 
proteins can be made in E. coli in a form that 
is useful for a variety of functions. The proce- 
dures employed are relatively quick and un- 
complicated, and the rewards for success are 
great. 
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Overview of Protein Expression 
in Mammalian Cells 



The techniques of gene isolation, modifica- 
tion, and transfer into appropriate host cells 
have provided a powerful means to study gene 
expression and to evaluate protein structure and 
function. These techniques make it possible to 
produce large amounts of proteins that pre- 
viously could be isolated in only minute quan- 
tities and allow the generation of proteins with 
specific, designed alterations. 

As described elsewhere in this manual 
( Chapter 9), mammalian cells are often used as 
hosts for expression of genes obtained from 
higher eukaryotes because the signals for syn- 
thesis, processing, and secretion of these pro- 
teins are usually recognized. Mammalian ex- 
pression of foreign genes has been used for 
many purposes including: ( 1 ) confirmation that 
genes isolated by different approaches can di- 
rect the synthesis of a desired protein, (2) evalu- 
ation of the effect of specific mutations intro- 
duced into genes, (3) direct isolation of genes 
based on screening or selecting recipient cells 
for production of a particular protein, (4) pro- 
duction of large amounts of proteins that nor- 
mally are available in only limited quantity, and 
(5) analysis of physiologic consequences of 
expression of specific proteins. 

The units that follow in Sections III and IV 
describe three vector systems or strategies for 
introducing foreign genes into mammalian 
cells (additional transfection methods can be 
found in units 9.1-9.4). The first method relies 
upon COS cells for rapid, transient expression 
of protein from specific vectors (unit it. u). Trie 
second method relies upon Chinese hamster 
ovary (CHO) cells; in this procedure, a targeted 
gene is cotransfected with a selectable marker, 
becomes stably integrated into the host cell 
chromosome, and is subsequently amplified 
{unit I6.14). The third system relies upon vac- 
cinia viral vectors in a transient expression 
system (units J6 15-16 19). The systems described 
in these units differ in the ways in which DNA 
is introduced into the cell, in the particular 
vectors used with each system, and in their 
suitability for particular cell types. 



The criteria for choosing a certain system 
include these considerations: whether DNA 
can be introduced directly by transfection 
methods or needs to be introduced by viral-me- 
diated transfer, the identity of the control ele- 
ments that can direct efficient mRNA expres- 
sion and protein synthesis, and whether a par- 
ticular host cell is appropriate for expression of 
the gene of interest. If it is necessary to produce 
a large amount of protein for a long period of 
time, theCHO system should be utilized. When 
transient expression is appropriate, the choice 
of which system to use depends upon the par- 
ticular experiment. When a high transfection 
efficiency is necessary, the vaccinia system is 
appropriate because every cell can be infected 
with the virus and gene of interest; however, 
this system suffers from the disadvantage that 
the cells die within one to two days. If a lower 
transfection efficiency is sufficient and if it is 
desirable that the cells continue to grow for 
several days, COS cells should be used. 

VIRAL-MEDIATED GENE 
TRANSFER 

Many viruses that infect mammalian cells 
have evolved mechanisms to usurp the protein 
synthesis machinery of the host to produce their 
viral proteins. The ability to engineer the ge- 
netic material of these viruses has made it 
possible to place desired coding regions under 
the control of the viral expression elements and 
to produce infectious virus particles that direct 
high levels of foreign gene expression. Viral- 
mediated gene transfer provides a convenient, 
efficient means to introduce foreign DNA into 
the majority of recipient cells. In addition, for 
many viruses, viral replication yields multiple 
copies of template DNA that can serve to in- 
crease the total amount of transcript made by 
the foreign gene. 

Because some viruses can infect a range of 
cell types derived from different species, 
viral-mediated gene transfer often allows the 
convenient introduction of foreign genes into a 
variety of different cells. Representative 



SECTION III 



UNIT 16.12 



Protein 
Expression 



Contributed by Randal J. Kaufman 

Current Protocols in Molecular Biology ( 1 997) 16.12.1-16.12.6 
Copyright <0 1997 by John Wiley & Sons, Inc. 



16.12.1 

Supplement 38 



Overview of 
Protein 
Expression in 
Mammalian Cells 

16.12.2 

Supple mcnt 38 



Table 16.12.1 Expression Levels and Uses for Different Mammalian Cell Expression Systems' 7 



Ccl 



line 



Expression 
method 



Typical expression 
level (Jig/ml) 



Primary use 



Monkey cells 
CVl 

COS 



CVl 



SV40 virus 
infection 

Transient DNA/ 

DEAE-dextran 

transfection 

Transient DNA/ 

DEAE-dextran 

transfection 



l-lO 



Expression of wild-type and 
mutant proteins 

Cloning by expression in 
mammalian cells; rapid 
characterization of cDNA clones; 
expression of mutant proteins 



0.05 



M u rin efib ro blasts 
CI 27 

3T3 

Other cells 
CHO(DHFR-) 



BPV stable 
transformant 

Retrovirus 
infection 



Primate 



Stable DHFR+ 
transformant 

Amplified MTX r 



Vaccinia virus 
infection 

EBV vector 



1-5 
0.1-0.5 

0.01-0.05 
10 
1 

n.a. 



High-level constitutive protein 
expression 

Gene transfer into animals; 
expression in different cell types 



High-level constitutive protein 
expression 

Production of vaccines; 
expression of toxic proteins 

Cloning by expression 



Abbreviations: BPV, bovine papilloma virus; EBV, Epstein-Barr virus. 



expression levels obtained from SV40 recom- 
binant viruses, retroviruses, and vaccinia vi- 
ruses are shown in Table 16.12.1 in compari- 
son to other expression strategies. A more 
detailed review of the different eukaryotic viral 
vectors can be found in Muzyczka (1989). 

Most viral expression systems have certain 
common limitations. The first is size of the 
inserted sequence. If the sequence is too large, 
it may not be packaged properly into the viral 
genome (maximal insert sizes for SV40 and 
retroviruses are 2.5 kb and 6 kb, respectively) 
and/or may be subjected to rearrangement upon 
propagation of the viral stock. The second limi- 
tation is the cytopathic effect of some viruses 
on the host cell, which limits expression to a 
relatively short period of time. Third, the vari- 
ability in gene expression depends on multiple 
parameters. The reasons for this are not com- 
pletely clear but depend upon proper transla- 
tion, processing, and modification of the result- 
ing protein; thus, there is a large degree of 



variability in the success with any particular 
DNA insert. 

units I6.i5-i6.i9 address one viral vector 
system, vaccinia virus, which has demonstrated 
success. Vaccinia virus is most useful for the 
production of proteins (such as regulatory fac- 
tors) that are potentially toxic to the cell. 

TRANSIENT EXPRESSION 

The efficiency of expression from transient 
transfection depends on the number of cells that 
take up the transfected DNA, the gene copy 
number, and the expression level per gene. 
Most methods of DNA transfer allow 5% to 
50% of the cells in the population to acquire 
DNA and express it transiently over a period of 
several days to several weeks. Eventually, how- 
ever, because cells containing the foreign DNA 
grow more slowly, they are lost from the popu- 
lation. Although this gradual decrease in the 
amount of expression occurs, transient expres- 
sion offers a convenient means to compare 
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expression from different vectors and to verify 
that any given expression plasmid is functional 
before initiating the more laborious procedure 
of isolating and characterizing stably trans- 
fected ceil lines. Because they do not require 
isolation of clones of cells with the vector 
integrated at different sites, transient expres- 
sion experiments are not subject to the differ- 
ences such position effects can have on expres- 
sion levels. In addition, transient expression 
experiments preclude the possibility of select- 
ing cells that harbor mutations either in the 
transfected DNA or elsewhere in the host chro- 
mosome (which may influence results). 

Transient DNA transfection is most fre- 
quently used to: ( 1 ) verify the identity of cloned 
genes based on their ability to express a par- 
ticular activity, (2) rapidly study the effect of 
engineered mutations on either gene activity or 
protein function, and (3) isolate genes from 
cDNA libraries constructed in mammalian ex- 
pression vectors based on their ability to ex- 
press a particular activity in cells. The limita- 
tions of transient expression are that it is diffi- 
cult to scale up for production of large 
quantities of protein (>1 mg), that it is difficult 
to study the consequences of gene expression 
only in the portion of the total population that 
has been transfected, and that the high copy 
number is eventually lethal; this lethality may 
significantly affect results. 

unit 16. 13 describes procedures and vectors 
used for transient expression in COS cells. This 
cell line is most frequently used for transient 
expression and is derived from African green 
monkey kidney cells by tranformation with an 
origin-defective simian virus 40 (SV40). COS 
cells express high levels of the SV40 large 
tumor (T) antigen which is required to initiate 
viral DNA replication at the origin of SV40. T 
antigen-mediated replication can amplify the 
copy number of plasmids containing the SV40 
origin of replication to > 1 00,000 per cell, which 
results in high expression levels from the trans- 
fected DNA. 

STABLE DNA TRANSFECTION 

If a selection procedure is applied after DNA 
transfection, it is possible to isolate cells that 
have stably integrated the foreign DNA into 
their genome (see unit 9.5 for an additional 
discussion). Different cell lines exhibit differ- 
ent frequencies of stable transformation and 
vary in their capacity to incorporate foreign 
DNA. In most cases, the limiting factor for 
obtaining stable transformants is the frequency 
of DNA integration, not the frequency of DNA 



uptake. Cells selected for by incorporation and 
expression of one genetic marker will fre- 
quently incorporate a second gene provided on 
an independent plasmid during transfection; 
this ability to incorporate two separate plasmids 
into the chromosome has been termed ^trans- 
formation. During this process, transfected 
DNA molecules usually become ligated inside 
the cell and subsequently integrate as a unit via 
nonhomologous recombination into host chro- 
mosomal DNA. 

Different cell lines and transfection methods 
yield varying frequencies of cotransformation. 
For example, the frequency of stable cotrans- 
formation in CHO cells is lower than that ob- 
served in mouse L cells, possibly because 
mouse L cells are able to incorporate more 
DNA into their genome than CHO cells. 
Cotransformation using calcium phosphate- 
mediated DNA transfection is very efficient, 
whereas cotransformation by fusion of bacte- 
rial protoplasts (containing two independent 
plasmids) with mammalian cells is very rare. 
When cotransformation is inefficient, it is de- 
sirable to engineer both the selectable marker 
and the gene of interest into the same plasmid,, 
A number of vectors that facilitate this proce- 
dure have been constructed (Kaufman, 1990a), 

Stable transformants arc usually selected by 
their ability to confer resistance to cytotoxic 
drugs. Such resistance can be recessive or 
dominant. Genes conferring dominant drug re- 
sistance can be used independently of the host 
cell line. Frequently, selectable marker genes 
are derived from bacterial genes for which there 
is no mammalian counterpart. For example, the 
Tn5 neomycin phosphotransferase gene encod- 
ing resistance to the antibiotic G418, or the 
Escherichia coli hygromycin phosphotransfer- 
ase gene encoding resistance to hygromycin 
have been engineered to be expressed and se- 
lected in mammalian cells. 

Genes conferring recessive drug resistance 
require a particular host that is deficient in the 
activity being selected. Many recessive genetic 
selectable markers encode enzymes involved in 
the purine and pyrimidine biosynthetic path- 
ways. When de novo biosynthesis for purines 
or pyrimidines is inhibited, cells can utilize 
purine or pyrimidine salvage pathways, provid- 
ing the salvage enzymes (i.e., thymidine kinase, 
hypoxanthine-guanine phosphoribosyltrans- 
ferase, adenine phosphoribosyltransferase, or 
adenosine kinase) necessary for the conversion 
of nucleoside precursors to the corresponding 
nucleotides are present (Fig 16.12.1). These 
salvage enzymes are not required for cell 
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growth when tic novo purine or pymmdme 
biosynthesis is functional; thus, cells deficient 
in a particular salvage pathway are viable under 
normal growth conditions. However, when 
drugs that inhibit de novo biosynthesis (such as 
methotrexate) are added, the cells die. Cells that 
acquire the capability to express the deficient 
activity (i.e., the missing salvage en/.yme) via 
gene tranfer can be selected for under these 
conditions. In a complementary manner, in 
cells that are defective in de novo biosynthetic 
pathways and have functional salvage path- 
ways, it is possible to select for expression of 
the defective gene (e.g., dihydrofolate redu- 
catase or aspartate transcarbamylase) in the de 
novo pathway by removal of nucleosides from 
the growth medium. 




AMPLIFICATION OF 
TRANSFECTED DNA 

Frequently, it is desirable to increase expres- 
sion by selecting for increased copy number of 
the transfected DNA within the host chromo- 
some. The ability to coamplify transfected 
DNA has permitted a 100- to 1000-fold in- 
crease in the expression of the proteins encoded 
by transfected DNA. Although there are over 
twenty selectable and amplifiable genes that 
have been described (Kaufman, 1990a), the 
most experience and success has occurred 
when methotrexate selection has been used for 
amplification of transfected dihydrofolate re- 
ductase genes, unit 16 n describes the use of 
dihydrofolate reductase-deficient CHO cells to 
obtain high-level expression of heterologous 
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Fig. 16.12.1 Purine and pyrimidine biosynthetic pathways. The solid arrows indicate single 
reactions. Dashed arrows indicate multiple reactions. Solid squares indicate reactions inhibited by 
methotrexate. The hatched square indicates the primary reaction inhibited by azaserine. The open 
square indicates the reaction inhibited by mycophenolic acid. Abbreviations used for enzymes 
involved in de novo biosynthetic pathways: DHFR, dihydrofolate reductase; CAD, carbamoyl-phos- 
phate synthetase, aspartate transcarbamoylase, and dihydroorotase; SHMT, serine hydroxymethyl 
transferase; TS, thymidylate synthetase; IMPDH, inosine-monophosphate dehydrogenase. Abbre- 
viations used for enzymes involved in salvage pathways: TK, thymidine kinase; ADA, adenosine 
deaminase; PNP, purine nucleoside kinase; AK, adenosine kinase; APRT, adenosine phosphori- 
bosyltransferae; HGPRT, hypoxanthine-guanine phosphoribosyltransferase; XGPT, £ coli xan- 
thine-guanine phosphoribosyltransferase. Other abbreviations: FH, tetrahydrofolate; xyl A, 
9-p-D-xylofuranosyl adenine. This figure has been adapted from Kaufman (1987). 
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genes through coamplification by selection for 
methotrexate resistance. 

EXPRESSION VECTORS 

Although a wide variety of expression vec- 
tors have been described over the past 5 years, 
it is difficult to compare results from different 
vectors used in different laboratories with dif- 
ferent inserts. Most mammalian cell expression 
vectors are designed to accomodate cDNAs 
rather than large genomic fragments because 
the small size of cDNA clones makes them 
more convenient to manipulate. Today most 
useful vectors contain multiple elements in- 
cluding: (1) an SV40 origin of replication for 
amplification to high copy number in COS 
monkey cells, (2 ) an efficient promoter element 
for high-level transcription initiation, (3) 
mRNA processing signals such as mRNA 
cleavage and polyadenylation sequences, and 
frequently intervening sequences as well, (4) 
polylinkers containing multiple restriction en- 
donucleases sites for insertion of foreign DNA, 
(5) selectable markers that can be used to select 
cells that have stably integrated the plasmid 
DNA, and (6) plasmid backbone sequences to 
permit propagation in bacterial cells. 

In addition to the previously mentioned 
properties, most vectors also contain an induc- 
ible expression system that is regulated by an 
external stimulus. Sequences from a number of 
promoters that are required for induced tran- 
scription have been identified and engineered 
into expression vectors to obtain inducible ex- 
pression. Several useful inducible vectors have 
been based on induction by P-interferon, heat- 
shock, heavy metal ions, and steroids such as 
glucocorticoids ( Kaufman, 1 990b). If the effect 
of an expressed protein on a particular cellular 
process is being studied, it is important to de- 
termine that the inducing stimulus does not 
interfere with that cellular process. It is also 
important to know the factor of induction (i.e., 
the difference between the basal and induced 
level of expressed protein) as well as the maxi- 
mal achievable expression level. In many cases, 
the factor of induction may be large but the 
maximal level of expression very low com- 
pared to expression from a strong constitutive 
promoter. 

CHOICE OF EXPRESSION 
SYSTEM 

In evaluating which approach to take in 
expressing a gene, it is most important to con- 




sider the goals of the expression work. If ex- 
pression is required to demonstrate that a clone 
has some functional activity or to characterize 
this activity, then transient expression in COS 
cells is often the most convenient approach. If 
a large quantity of protein (>1 mg) is required, 
then stable coamplification in CHO cells is 
generally the most desirable approach. If the 
gene is potentially cytotoxic, high-level expres- 
sion may be approached through vaccinia virus 
vectors or inducible promoter-vector systems. 
If there is a particular requirement for the host 
to produce the protein properly then that re- 
quirement will dictate the choice of the host. It 
is unusual that proteins do require host-specific 
posttranslational modifications. However, if 
this does occur, it is usually desirable to study 
expression of the gene in a variety of host cells, 
and a retrovirus vector would be the system of 
choice. 

TROUBLESHOOTING 

If protein expression from the heterologous 
gene cannot be detected, it is important to 
examine the vector system in detail. In this 
sequence, each point should be satisfactorily 
addressed before proceeding to the next step. 

1. Confirm the expected structure of the 
vector using restriction mapping (units 3. i & 3.2) 
and, if necessary, DNA sequencing (units 7.4 
& 7.5). 

2. Determine transfection efficiency by in- 
cluding a positive control — e.g., the same vec- 
tor with another insert. 

3. Ensure that the RNA is of the expected 
size and amount compared to an appropriate 
control by preparing RNA (units 4. 1.4.2. & 9.8) and 
analyzing it by northern hybridization (unit4.9). 

4. Use a completely different expression 
vector or system (units I6.u-i6.i8) if the RNA 
transcript of the correct size cannot be detected 
in the transfected cells, as it is always possible 
that some unforeseen situation may result in 
aberrant splicing (Wise et al., 1989). 

5 . Determine if the coding region may con- 
tain a point mutation or other lesion that keeps 
it from encoding a full-length protein by carry- 
ing out in vitro translation to produce protein 
(unit 10 n) using mRNA isolated from trans- 
fected cells and using RNA transcribed by in 
vitro transcription (i.e., SP6; unit3.8) of a vector 
containing the cDNA insert. 
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Transient Expression of Proteins 
Using COS Cells 

This unit describes the use of COS cells to efficiently produce a desired protein in a short 
period of time. These cells express high levels of the S V40 large tumor (T) antigen, which 
is necessary to initiate viral DNA replication at the SV40 origin. Three factors contribute 
to make COS cell expression systems appropriate for the high-level, short-term expression 
of proteins: (1) the high copy number achieved by SV40 origin-containing plasmids in 
COS cells 48 hr posttransfection, (2) the availability of good COS cell expression/shuttle 
vectors, and (3) the availability of simple methods for the efficient transfection of COS 
cells. Each COS cell transfected with DNA encoding a cell-surface antigen (in the 
appropriate vector) or cytoplasmic protein will express several thousand to several 
hundred thousand copies of the protein 72 hr posttransfection. If the transfected DNA 
encodes a secreted protein, up to 10 |ig of protein can be recovered from the supernatant 
of the transfected COS cells 1 week posttransfection. COS cell transient expression 
systems have also been used to screen cDNA libraries, to isolate cDNAs encoding 
cell-surface proteins, secreted proteins, and DNA binding proteins, and to test protein 
expression vectors rapidly prior to the preparation of stable cell lines (unit 9.5), 

This transfection protocol is a modification of that presented in unit 9.2 and gives 
conditions for optimal transfection of COS cells (see unit 9.9 for additional details). The 
main difference between this procedure and that in unit 9.2 is the composition of the 
DEAE-dextran/chloroquine solution, which is prepared here in PBS, not TBS, and 
contains chloroquine to prevent the acidification of endosomes presumed to carry the 
DEAE-dextran/DNA into the cell. (This acidification results in acid hydrolysis of the 
DNA, giving rise to mutations and destruction of the DNA.) With this protocol, 40% to 
70% of the cells can be routinely transfected. 

Materials 

Appropriate vector (see commentary) 

COS-7 cells to be transfected (see critical parameters; ATCC #CRL1651) 
Dulbeccos minimum essential medium with 10% calf serum (DMEM-10 CS; 

UNIT 9,4) 

DMEM with 10% NuSerum (Collaborative Research #55000) (DMEM-10 NS; 

UNIT 9.4), 37°C 

Phosphate-buffered saline (PBS; appendix 2) 

DEAE-dextran/chloroquine solution: PBS containing 10 mg/ml DEAE-dextran 

(Sigma #D9885) + 2.5 mM chloroquine (Sigma #C6628) 
10% dimethyl sulfoxide (DMSO; Sigma #D5879) in PBS 
0.5 mM EDTA in PBS 

100-mm tissue culture dishes 
Humidified 37°C, 6% C0 2 incubator 
Phase-contrast microscope 
Sorvall RT-6000B rotor (or equivalent) 

Additional reagents and equipment for subcloning DNA (unit 3.16), preparing 
miniprep DNA (untti.6), and purifying DNA by CsCl/ethidium bromide 
equilibrium centrifugation (units 1.7 & 9.1) 

1. Subclone the gene of interest into the appropriate vector to obtain the desired 
recombinant DNA. Purify the recombinant DNA by a miniprep procedure (5-ml 
culture) or by CsCl/ethidium bromide centrifugation. 
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2. Seed COS-7 cells in DMEM-10 CS at -20% confluence per 100- mm dish the day 
prior to transfection (so they will be -50% confluent the next day). Grow cells 
overnight in a C0 2 incubator (6% C0 2 ) at 37°C to -50% confluence. 

A confluent dish of COS-7 cells cells) is usually split 1:5 on the day prior to 

transfection to give 2 > 10 5 cells/100-mm dish in 10 ml of DMEM-10 CS. 

3. Just before use (for each 100-mm dish of COS cells to be transfected), thoroughly 
mix 5 ml of 37°C DMEM-10 NS with 0.2 ml of DEAE-dextran/chloroquine solution. 
Add 5 to 10 |ig recombinant DNA and mix. 

It is important that the DEAE-dextran be well mixed with the media before adding the 
DNA — otherwise, the DNA, a negatively charged polymer, will form large precipitates with 
DEAE-dextran, a positively charged polymer. These large precipitates cannot be taken up 
by the cell, resulting in a reduced transfection efficiency. When larger dishes are used, the 
amount of mediurn/DEAE/DNA should be sufficient to easily cover the cells and should 
include 400 fig/ml DEAE-dextran, 100 fiM chloroquine, and 1 to 2 p.g/ml DNA in 
DMEM -10 NS. 

Either CsCl-purified (UNIT 1.7) or miniprep (UNIT 1.6) plasmid DNA can be used for the 
transfections. If miniprep DNA is used, use l /$ of the miniprep per transfection. DNA can 
also be prepared as described in the second support protocol of UNIT 9.1. 

4. Aspirate medium from COS cells and for each 100-mm dish, add DMEM-10 
NS/DEAE-dextran/DNA prepared in step 3. Incubate cells 3 to 4 hr in a C0 2 incubator 
at 37°C. Observe the cells using a phase-contrast microscope. 

The DEAE-dextran will cause cells to retract and become vacuolated. Efficiency of 
transfection increases with longer incubation periods; on the other hand, so does cell death. 
The 3- to 4-hr incubation suggested here is a good starting point. However, several time 
points should be tried to optimize transfection of the particular population of cells used. 

5. Aspirate DMEM/DEAE-dextran/DNA and add 5 ml of 10% DMSO (prepared in 
PBS). Incubate cells 2 min at room temperature. 

The DMSO shock results in increased transfection efficiencies. Without this step, transfec- 
tion efficiencies might be lower by a factor of two or more, 

6. Aspirate DMSO and add 10 ml DMEM-10 CS. Grow cells overnight (12 to 20 hr) in 
a C0 2 incubator at 37°C. 

7. Passage ( split and replate) each 100-mm dish of transfected COS cells into two new 
100-mm dishes. Grow cells in a C0 2 incubator at 37°C as described in step 8a or 8b. 

After transfection, the COS cells will look unhealthy. Passaging them the day after 
transfection facilitates recovery, resulting in better levels of protein expression. In addition, 
DEAE-dextran treatment makes the cells sticky, and passaging the cells the morning after 
transfection restores their adhesion characteristics so that they may be once again lifted 
by a gentle treatment with PBS and EDTA (see step 8b). 

8a. When expressing secreted proteins, add 5 ml DMEM-10 CS 96 hr (4 days) after 
completing step 7 and incubate 4 days. Harvest the medium, remove dead cells and 
debris by centrifuging 10 min in a Sorvall RT-6000B rotor at -2000 rpm (-1000 x 
g) y room temperature, and save the supernatant (see anticipated results). Detect 
secreted proteins by metabolic labeling (unit io.is) and immunoprecipitation 
(unit 10.16), immunoaffinity chromatography (unit io.ii), radioimmunoassay 
(unit n.16), western blotting (unit w.s), or bioassay (unit 9.5). 

Do not aspirate the old medium prior to addition of 5 ml DMEM-10 CS because this 
medium contains the secreted protein. Addition of extra medium 96 hr posttransfection 
results in better yield of expressed protein; however, it also increases the level of total 
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protein (since the medium contains 10% serum), which could complicate protein purifica- 
tion To eliminate this problem, COS cells can be placed in serum-free medium 10 to J 2 hr 
after they have been replated although (in our hands) this results in a 10-fold lower yield 
of expressed protein than in the presence of serum. Thus, unless it is absolutely necessary 
to remove additional contaminating protein, serum should be present in the medium even 
at reduced levels (1% to 27c). 

8b. When expressing cell-surface or intracellular proteins, aspirate medium from cells 
72 hr (3 days) after transfection in step 6. Add 5 ml PBS, swirl, and aspirate PBS. 
Add 5 ml of 0.5 mM EDTA in PBS and incubate 15 min in a C0 2 incubator at 37°C. 
Lift cells from the dish by gently dislodging them with a Pasteur pipet. Stain 
cell-surface proteins with the appropriate fluorescent antibody and detect by micros- 
copy or flow cytometry (Yokoyama, 1991). 

Transfected COS cells will tend to clump when lifted from the dish. Pipetting the cells up 
and down will tend to disrupt these clumps. More effective dispersion of the clumps can be 
obtained by forcing the cells through a 100-fiM nylon mesh. 



COMMENTARY 
Background Information 

COS cells 

COS cells are African green monkey kidney 
cells (CV-1) that have been transformed with 
an origin-defective SV40 virus, which has in- 
tegrated into COS cell chromosomal DNA. 
Therefore, COS cells produce wild-type SV40 
large T antigen but no viral particles. Since 
SV40 large T antigen is the only viral protein 
required in trans (i.e., its coding sequence need 
not be located on the DNA molecule on which 
it acts) for viral replication, SV40 origin-con- 
taining plasmids replicate in these cells to a 
high copy number (10,000 to 100,000 cop- 
ies/cell) 48 hr posttransfection. If the plasmid 
carries a cDNA or genomic insert encoding a 
desired protein (under the control of the appro- 
priate promoter), COS cells will express the 
protein at relatively high levels over a short 
period of time. Transfected COS cells produce 
protein in a burst that starts -24 hr posttrans- 
fection and can last for up to a week. However, 
due to the excessive burden placed on the trans- 
fected cell by the replicating plasmid and the 
high levels of protein production, the trans- 
fected cells typically either die or lose the 
plasmid a week after transfection. 

COS cells were developed by Yakov 
Gluzman (1981) as a host for the propagation 
of SV40 virus early-region mutants. The first 
SV40 origin-containing plasmids to be used in 
conjunction with COS cells were made by 
Lusky and Botchan (1981). Short-term expres- 
sion systems using both COS cells and SV40 
origin-containing plasmids were initially used 
to identify DNA sequences required for tran- 



scription of the human a 1 -globin gene (Mellon 
et al., 1981). 

COS cells were first used to produce cell- 
surface and secreted proteins by Rose and 
Bergmann (1982), who looked at the expres- 
sion of wild-type and mutant vesicular stoma- 
titis virus glycoprotein in transfected cells. This 
technique was subsequently used to study the 
expression of insulin (Laub and Rutter, 1983), 
somatostatin (Warren and Shields, 1984), and 
acetylcholine receptors (Mishina et al., 1984). 
These experiments demonstrated that COS 
cells could be used to express biologically ac- 
tive cell-surface and secreted proteins. Further- 
more, these proteins were correctly processed 
although they are normally not produced by 
COS cells. 

COS cell expression was initially used to 
screen a cDNA library to isolate a cDNA en- 
coding human granulocyte/macrophage col- 
ony-stimulating factor (Wong et al., 1 985). This 
was subsequently extended to isolation of 
cDNAs encoding cell-surface proteins (Seed 
and Aruffo, 1987; Aruffo and Seed, 1987a) and 
DNA-binding proteins (Tsai et al., 1989). 

The expressed protein produced in COS 
cells is, in most cases, biologically active. How- 
ever, although COS cells are able to carry out 
some posttranslational modifications, they may 
not modify the expressed protein in exactly the 
same way as the cell that would normally pro- 
duce it. For example, COS cells do not express 
the a-( 1 ,3 )fucosy ltransferase which is capable 
of transferring fucose to either sialyl or asialyl 
precursors (Goelz et ah, 1990). In addition, 
insufficient posttranslational modification oc- 
curs in the case of lymphocyte cell-surface 
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proteins, winch tend to be underglycosylated in 
COS cells (ArulTo and Seed, 1987b). This 
might be due to an overburdening of the COS 
cell glycosylation machinery by the high levels 
of protein expression and/or by the lack of 
enzymes required to carry out the full postrad- 
iational modifications. 

An alternative to COS cells is provided by 
WOP cells (Dailey and Basilico, 1985), which 
are mouse 3T3 cells transformed by an origin- 
defective polyoma virus. Like COS cells, they 
produce no viral particles. However, they pro- 
duce polyoma large T antigen and are therefore 
capable of replicating a plasmid containing a 
polyoma origin of replication to a copy number 
that is typically ten times lower than that ob- 
tained in COS cells. WOP cells are also more 
delicate than COS cells, making them harder to 
transfect. For these two reasons, COS cells 
should be used whenever possible. However, 
WOP cells should be used in those cases where 
a monoclonal antibody which is used to identify 
and/or purify the protein being expressed cross- 
reacts with COS cell proteins. This situation 
has arisen when transient expression in COS 
cells was carried out in order to clone a human 




protein witli a mouse anti-human monoclonal 
antibody that also recognizes the equivalent 
monkey protein (Seed et ai., 1987). In this 
situation, the mouse cell line presented a useful 
alternative to COS cells by avoiding mono- 
clonal antibody cross-reactivity. 

Vectors 

The main requirements of any COS cell 
expression/shuttle vector are: (1) an SV40- 
derived origin of replication, (2) appropriate 
eukaryotic transcription regulatory elements 
(i.e., enhancer, promoter, and polyadenyla- 
tion signal sequences), (3) a prokaryotic 
origin of replication, and (4) a prokaryotic 
genetic marker for selection in Escherichia 
coli. A particularly useful example of such a 
vector is CDM8 (Fig. 16.13.1 ; Seed, 1987 ). The 
eukaryotic transcription element of CDM8 is 
composed of the cytomegalovirus (CMV) en- 
hancer-promoter, with an SV40 virus-derived 
intron and polyadenylation signal; the CMV 
promoter is a cis element (i.e., one that is 
located adjacent to the DNA it acts on) that 
directs transcription of the DNA subcloned 
downstream from it. The prokaryotic genetic 
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Figure 16.13.1 CDM8. CDM8 (Seed, 1987) is a COS cell expression/shuttle vector that con- 
tains an SV40-derived origin of replication (SV40 ori), eukaryotic transcription regulatory elements 
[splice and poly(A) + ], a prokaryotic origin of replication (tivx; derived from the pBR322 ori), and a 
prokaryotic genetic marker (supF). In addition, CDM8 contains an M13 origin of replication (M13 
ori), a T7 RNA polymerase promoter (p C mvar). and a polyoma-derived origin of replication (Py ori). 
Any restriction endonuclease sites shown can be used for cloning, but the inserted fragment must 
have its 5' end nearest the enchancer Pcmim- Tne stuffer sequence is used to detect a size 
difference after a restriction digest. 
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marker in CDM8 is provided by the supF (am- 
ber suppressor) gene. CDM8 is propagated in 
host bacterial cells containing helper plasmid 
P3, which contains amber mutations in the 
genes responsible for tetracycline and ain- 
picillin resistance (unit i a). (P3 has been intro- 
duced into many E. coli strains.) When CDM8 
is transformed into an E. coli strain containing 
P3 (unit i x), the amber mutations are sup- 
pressed, rendering the host resistant to tetracy- 
cline and ampicillin. In addition to these ele- 
ments, the CDM8 expression/shuttle vector 
contains an M13 origin of replication so that it 
can be used for the production of single- 
stranded DNA ( unitl 75), a T7 RNA polymerase 
promoter for preparation of mRNA in vitro 
(unit 3.x), and a polyoma virus-derived origin 
of replication which permits plasmid replica- 
tion in WOP cells. 

Other vectors commonly used for COS cell 
transient expression include pXM (Yang et al., 
1986) and pDC201 (Sims et al., 1988). These 
two plasmids contain the adenovirus-2 major 
late promoter and tripartite mRNA leader. This 
element acts in conjunction with the adenovirus 
VA RNA (also produced by the vectors) to 
increase the translatability of the mRNA encod- 
ing the desired protein. It is thought that ade- 
novirus VA RNAs increase translation effi- 
ciency of mRNAs containing the major late 
promoter tripartite leader by facilitating the 
interaction between mRNA and a 43S ribo- 
somal protein translation preinitation complex 
(Kaufman, 1985). 

Critical Parameters 

Efficiency of transfection depends critically 
on the length of time that the cells (COS or other 
cells) are incubated in the presence of DEAE- 
dextran/DNA. Longer periods of time result in 
higher transfection efficiencies. However, the 
DEAE-dextran/chloroquine solution is quite 
toxic to cells and in general, cells should not be 
in its presence for >4 hr. In the past, DEAE- 
dextran transfections were carried out in the 
absence of serum, because a precipitate of un- 
known composition that seemed to be very 
toxic formed in DEAE-dextran/calf serum mix- 
tures. Medium containing 10% NuSerum, on 
the other hand, does not form this precipitate 
and tends to enhance the ability of the cells to 
tolerate DEAE-dextran; thus NuSerum should 
always be included in transfection medium. 

Efficiency of transfection can also be af- 
fected by the quality of the DNA and the age 
of the DEAE-dextran/chloroquine solution. It 
is preferable to use CsCl-purified or other 



highly purified DNA whenever possible (unit 
1.7). However, miniprep DNA (unit i. 6) or DNA 
purified using a pZ523 column (5'-*3') or by 
other methods can also be used. The DEAE- 
dextran chloroquine solution can be kept at 4°C 
for several months but it is wise to prepare it 
fresh about every 3 months. 

COS cells can be obtained from the Ameri- 
can Type Culture Collection; several sublines 
exist including COS- 1 and COS-7. The COS-7 
subline is recommended because it produces a 
higher plasmid copy number. These cells grow 
as a monolayer in DMEM-10 CS in a humidi- 
fied 37°C, 5% C0 2 incubator; however, the 
ATCC grows its COS cells in DMEM-10 fetal 
calf serum (FCS). Since ECS is significantly 
more expensive than CS, changing the growth 
medium is worthwhile. The change from FBS 
to CS should be done slowly over 1 to 2 weeks. 

Because the growth characteristics, trans- 
ferability, and protein expression properties of 
COS cells change with time and with repeated 
subculturing, and because these changes tend 
not to favor the production of high levels of 
proteins, it is prudent to freeze aliquots of the 
original COS cell stock in DMEM-10 CS/10% 
DMSO for later use in a -70°C freezer for 24 
hr and then transfer them to a -150°C (liquid 
nitrogen) freezer. COS cells grow rapidly re- 
quiring passage every 4 to 5 days; typically, a 
confluent plate of cells is split 1 to 10. 

To obtain good levels of transient protein 
production from transfected COS cells, it is 
very important to replate the transfected cells 
onto new dishes with fresh medium the morn- 
ing after tranfection. In addition to enhancing 
protein production, replating the transfected 
cells allows lifting of the cells from the dish 
using only PBS/EDTA (without trypsin). This 
is very important when transient expression by 
COS cells is used to produce cell-surface pro- 
teins. 

Anticipated Results 

The basic protocol should yield transfection 
efficiencies of 40% to 70%. When COS cells 
are being used to produce cell-surface or intra- 
cellular proteins, it can be expected that each 
transfected cell will express several thousand 
copies of this protein (10,000 to 100,000 cop- 
ies/cell) 72 hr posttransfection. If COS cells are 
used to produce secreted proteins, up to 1 p.g/ml 
of protein can be recovered from the super- 
natant of a 100-mm dish of transfected cells 1 
week posttransfection. However, the amount of 
protein produced by COS cells can vary dra- 
matically depending on the protein being pro- 
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duced. This was the case when COS cells were 
used to produce soluble immunoglobulin fu- 
sion forms of cell-surface proteins (Aruffo et 
al., 1990). In this case, one of the fusion pro- 
teins, CD8 immunoglobulin, was secreted from 
COS cells at high levels (1 |ig/ml) while the 
other, CD44 immunoglobulin, was secreted 
very poorly if at all. It was found that the CD44 
fusion protein was sequestered inside the cell. 
To obtain efficient secretion of the CD44 fusion 
protein, it was necessary to change the amino- 
terminal signal sequence of the CD44 fusion 
protein. Interestingly, the native cell-surface 
forms of both CD8 and CD44 are expressed 
equally efficiently on the surface of transfected 
COS cells. For some of these immunoglobulin 
fusion proteins, 0.5 ml of medium contained 
plenty of protein (-500 ng) after concentration 
using a protein A-affinity matrix. In some 
cases, it is possible to use such COS cell super- 
natants directly without further purification 
(Aruffo et al., 1990). 

Time Considerations 

It is important not to transfect the cells too 
soon after replating; >8 to 12 hr should pass 
between the time the cells are seeded on the 
plate and the time of transfection. Once the 
transfection has started, the DEAE-dex- 
tran/DNA mixture should be left on the cells 
for a minimum of 2 hr and a maximum of 4 hr; 
because the mixture increases transfection ef- 
ficiency it should remain in contact with the 
cells as long as they appear viable. After trans- 
fection, the cells look quite unhealthy and 12 
to 24 hr posttransfection, they should be re- 
plated in new dishes with fresh medium. 

The peak of plasmid replication in trans- 
fected COS cells occurs 48 to 72 hr posttrans- 
fection. Protein production starts 24 hr post- 
transfection but peaks 72 to 96 hr posttransfec- 
tion. Thus, when expressing cell-surface or 
cytoplasmic proteins, the cells should be har- 
vested 72 to 96 hr prosttransfection. However, 
transfected cells continue to produce protein for 
up to a week posttransfection and when ex- 
pressing secreted proteins, the supernatants 
should be harvested a week posttransfection. 
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This article shows that COS cells can be used as an 
efficient, short-term, mammalian epression system 
for the production of proteins. 
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Prokaryotic Expression Vectors 



pTrc 99 A Expression Vector Kit 

• For expression of proteins encoded by inserts lacking a 
start codon. 

pTrc 99 A is a derivative (1) of pKK233-2 expression vector 
with a strong trc promoter upstream of the MCS and a strong 
rrwB transcription termination signal downstream. It contains 
an Nco I site next to the trc promoter for ligation and direct 
expression of inserts possessing the start codon ATG. Nco I 
linkers provided with the kit permit ligation of inserts lacking 
a start codon and subsequent expression in all three possible 
reading frames. The kit includes the following components: 

pTrc 99 A Expression Vector 
Nco I Linker 5'-d[CAGCCATGGCTG]-3' 
Nco I Linker S'-d[AGCCATGGCT]-3' 
Na) I Linker 5'-d[GCCATGGC]-3' 
\L cob JM105 

Induction: trc promoter is inducible with 1 -5 mM 1PTG. It contains the 
trp-K region and lac UV5 -10 region separated by 17 bp. trc promoter 
is very strong; even uninduced cells may show a low level of expression. 
Expression: Nco I linker supplies ATG start codon in all three reading 
frames for expression of insert. Vector provides nbosome- binding site. 
Host(s): Supplied with E. colt JM105. Plasmid provides lac H repressor. 
Selectable marker(s): Plasmid confers resistance to 100 ug/ml ampicillin. 
Amplification: Recommended. 

Control Regions: Expression control region: trc promoter: -10: 21b-222; -35: 
193 19S, Ribosomc binding site: 255. MCS: 265-314. rmB opcron region: 5S 
rRNA r^ion: 409-528; rmB Tl terminator: 532-575; rmB T2 terminator: ,0,- 
734. N-amasc gene region: Promoter: -10: 799-804; -35: 776-78 1 Sun codon 
(ATG}: 846; Stop onion (TAA): 1704. lad* gene region: Sun codon (GTG). 3055, 
Stop codon (TGA): 4135. Plasm.d replication region: Site of replication initiation: 
246S-2467; Region necessary tor replication: 1^71-2467. 



ORDERING INFORMATION 



Product 



Quantity Code No 



Price (US $) 



p Trc 99 A Expression Vector Kit 
Companion Products 



1 kit 27-5007 01 



Code No. 



211.00 



Page 



Nco\ 



27-0971-01 



131 



Reference 

1. Amann, Ii. et aL Gene 69, 301 (1988). 
Note- For research purposes only. Commercial use requ.rcc a license from 
Beimnunverkc Aktiengeselkchafu 35S0 Marburg, Germany. 

m I GenBank Accession Number U13872. 



A TG <Ua ttc gag ctc -sgt acc cog gga tcctct iav^ ctgcagsca tgc aagctt 




pKK22V3 Expression Vector 

• For over-expression of proteins under the control of the 

strong tac promoter. 
pKK223-3 (1) contains the strong tac promoter upstream from 
the multiple cloning site and the strong rrnV> nbosomal ter- 
minator downstream for control of protein expression (2-4). 
Induction: tac promoter is inducible with 1-5 mM IPTC, Promoter is very 
strong; even uninduced cells may show a low level of expression. 
Expression: Genes containing a ribosome-binding site and ATG can be 
inserted into anv unique Site in the MCS for expression. ^ nb™>me- 
bindmg s.te on the plasmid can be utilized to express inserts if the AH, 
start codon of the insert is within 5-13 bp from the provided nbosomc- 
bindmg site (5, 6). 

Transcription terminators: miB transcription terminators stabilize the 

plasm.d by inhibiting read-through transcription initiated from the Ln 

promoter in the parent plasmid. 

Host(s): lac h strains; E. colt JM105 is recommended. 

Selectable marker(s): Plasmid confers resistance to 100 ug/ml ampialhn. 

Amplification: Recommended. 

Control Regions: Expression control region: tac promoter: -10: 50-44; 05: 72- 
67; R.bosome binding s.te: 1 1 MCS: 4552-1587 m.B opcron region: 5S rRNA 
region- 447M3S2; rmB Tl terminator: 434M305; rmB T2 terminate : 4173- 

H6 [Vlaaarnasc gene region: Promoter: -10: 4082-4077; -35: 4105^100 tan 
c«ion (ATG): 4035 R ; Stop codon (TAA): 3178. Plasm* rep .canon region: S e of 
repl.c3t.on m.t.afon: 2417-2415; Region necessary for repletion: 311 1.415. 

f tos"sJ. and Holy, A.. Pro, Nail. Acad. Sc, USA 81, 6929 (19S4). 
2 Amann, K. et aL Gene 25, 167 (1983). 



nRnFRlNG INFORMATION 1 


Product 


Quantity Cade No. 


Price (US %) 


pKK223-3 Expression Vector 


25 pg 27-4935-01 


135.00 



I / GenBank Accession Number M77749. 



y. TCCTTTGTCTTWGGGCCCCTAGGCAGCTGOACoT.-0. J TTCGAA -5' 
5-- ^GG^CAGJWrrcCCGGGGATCCGTCGAC|TT.>>.V^.AA.3CTT -3 




'S*1« not ixnqu" 



4. Jaffe, B. M. et aL Biochemistry 27, 1869 (1^8S) 

5 Koxak M., Microbial Rev. 47, 1 (1983). 

6". Cold, L„ Ann. Rev. hiochem. 57, 199 (1988). 



Eukaryotic Expression Vectors 



pSVK 3 Expression Vector 

• For in vivo expression in mammalian cell lines (1). 
The phagemid pSVK 3 is a true shuttle vector, containing both 
the colE 1 origin for replication in £. coli and the origin of 
replication, early promoter, mRNA splice site and polyadenyla- 
tion signals from SV40 for replication and expression in eu- 
karyotic cells. The presence of the fl origin of replication per- 
mits single-stranded phage production following infection of 
bacterial cultures with the helper phage M13K07. 

Expression: Expression is controlled by the unmducible SV40 early 
promoter. Expression is transient, but can be stable if the gene insert 
is selectable. 

Sequencing: Both double-stranded and single-stranded [{+) strand] 
sequencing are possible. A protocol for production of single-stranded 
DNA is provided with the vector. 

In vitro transcription: A T7 RNA polymerase promoter allows for in vitro 
transcription of cloned inserts. 

Host(s): E. coli and mammalian cells (CV-1, COS and HeLa cells). 
Selectable markers ): Plasmid confers resistance to 100 ug/ml ampicillin 
in E. colt. 

Control Regions: SV40 origin and early promoter region: Minimal region for 
replication: 239-314, TATA box: 252-258; 21 -bp repeats 170-232, 72-hp repeats: 
23-166; Transcription start region: 279-286. MCS: 377-435. SV40 early splice 
region: Small T antigen intron: 630-695. SV40 polyadenylation region: Poly A 
signal sequence: 1331-1336; PolyA site: 1352. T7 promoter: 352-374. (3-lactamase 
gene region: Promoter region: -10: 2017-2022; -35: 1994-1999; Start codon (ATG): 
2064; Stop codon (TAA): 2922. Plasmid replication region: Site of replication 
initiation: 3683-3685; Region necessary for replication: 2989-3685. fl region: 
1478-1932; Origin of fl replication: 1770. 

Reference 

1. Mongkolsuk, S., Gene 70, 313 (1988). 



ORDERING INFORMATION 


Product 


Quantity 


Code No. 


Price (US $) 


pSVK 3 Expression Vector 


25 uq 


27-4511-01 


154.00 


Companion Products 




Code No. 


Page 


M13K07 Helper Phage 




27-1524-01 


98 



I / GenBank Acccssm >n Number Ul ^867. 



[TAATACGA'. T: « TTATAG&GAG/ >TCGAATTi"G»<X Kj GGTACq 

i 



t ' r -.■--K)ter 

CGGGGAI-.-. r AGAGTtXiACCTGCAGG'VVy C'CGAGCTC 




Antigen Spica Site 



BamH 1(1401) 
Pvul(t433)' 
Bgll(l459}' 



pSVL SV40 Late Promoter Expression Vector 

• For high-level transient expression in eukaryotic cells. 
Expression: Genes inserted into the multiple cloning site are transiently 
expressed from the first ATG using the SV40 late promoter (1,2). Trans- 
cripts are spliced and poiyadenylated using the SV40 VP1 processing sig- 
nals. Absence of the pBR322 "poison sequence" results in higher plasmid 
copy number and higher level of transient expression in COS cells (3). 
Host(s): Any suitable restriction -minus £. coli strain for amplification and 
cloning; mammalian cells including COS (3), CV1, HeLa and 3T3 cells. 
Highest level of eukaryotic expression is found in T antigen -producing 
cells, such as COS. 

Selectable marker(s): Plasmid confers resistance to 100 ug/mi ampicillin 
in E. coli. Stable transformants in eukaryotes can be isolated if the 
inserted gene is selectable. 
Amplification: Recommended. 

Control Regions: SV40 origin and late promoter region: Minimal region for 
replication; 4824^*834; 21-bp repeats: 41-103; 72-bp repeats: 107-250, Major 
transcription start site: 325. VP1 region: VP1 leader sequence: 323-525, VP1 
intron: 526-1493 MCS: 1506-1535. SV40 polyadenylation region: PolyA signal 
sequence: 1654-1659, PolyA site: 1671. p-lactamase gene region: Promoter region: 
-35: 4230^225; -10: 4207-4202; Start codon (ATG) 4160; Stop codon (TAA): 
3302. Plasmid replication region: Site of replication initiation: 2541-2539; Region 
necessary for replication: 3235-2539. 

References 

1. Sprague, J. et ai, /. Virology 45, 773 (1983). 

2. Templeton, D. and Eckhart, W, Moi Cell. Biol 4, 817 (1984). 

3. Lusky, M. and Botchan, M., Nature 293, 79 (1981). 



ORDERING INFORMATION 



Product 



Quantity Code No. 



pSVL SV40 Late Promoter 
Expression Vector 



25 ug 27-4509-01 



1/ GenBank Accession Number U13«St>8. 



1506 15-35 
I I 
CTCGAiji.. TAGACCCGGGGAGCTCGGATCC 




Price (US $) 



135.00 



Pst I (36 U) 
Pvu I (3740) 



Contact us today: 



Eukaryotic Expression Vectors 



pMSG Inducible Mammalian Expression Vector 

• For inducible exf)ression in mammalian cells (1). 
Induction: Expression is controlled by the MMTV LTR promoter 
(mouse mammary tumor virus long terminal repeat) located upstream of 
the multiple cloning site (MCS). Genes inserted into the MCS are trans- 
lated from the hrst ATG of the insert and expressed from the promoter, 
which is glucocorticoid -inducible (0.1 uM dexamethasone). 
Selection in eukaryotes: pMSG contains the E. colt xanthine -guanine 
phosphon bosyltransferase {gpt) gene which is expressed from the SV40 
early promoter and therefore provides a marker for selecting stable 
transformants in eukaryotic cells (1-3). Cells with a functional^ gent- 
are resistant to mycophenolic acid. 

Stable expression: Cells must be hormonally responsive (1-6) for expres- 
sion from the MMTV promoter. pMSG will integrate randomly into the 
host genome under selective pressure oi mycophenolic acid. 
Transient expression: After transfection, transient expression is inducible 
with dexamcthasone. Cells must be hormonally responsive (1-6) for 
expression from the MMTV promoter. 

Host(s): ret: A' E. colt strains such as HR101 (Note: pMSG contains 
regions of internal homology; use with some bacterial strains may lead to 
deletions in the vector.) Mammalian cells including Chinese hamster 
ovary (< HO} (1 WcU cells (3) and mouse Ltk" (4) and 3T6 (5). For 
induction, cells must be responsive to glucocorticoid compounds. 
Selectable marker(s): Plasmid confers resistance to 100 ug/ml ampicillin 
in /,. coli. Stably transformed mammalian cells are resistant to mvco- 
phenolic acid. 

Amplification: Recommended. 

Control Regions: MCS: 1-23. MMTV LTR region: LTR: 6178-7442; Hormone 
response element: 7122-7306; Approximate site for transenprion initiation: 7358. 
SV40 carlv splice region: Small T antigen intron: 101-165. SV40 polyadcnylarion 
region: Poly A signal sequence: 801-806; PoIyA site: 822. SV40 origin and early 
promoter regit >n: Minimal region for replication: 1116-1191; TATA box: 1 129-1 135; 
21 -bp repeats: 1 04^-1 109; 72-bp repeats: 900-1043; Transcription initiation region: 
1 1 56-1 163. &>t gene region: gf>t gene: 1218-2279; Start codon (ATG): 1416, Stop 
codon (TAA): 1872. SV40 early splice region: Small T antigen intron: 2351-2415. 
SV40 poly ademption region: PolyA signal sequence: 3051-3056; PolyA site: 3072. 
fVlactamase gene region: Promoter region -10: 40364041; -35: 40134018; Start 
codon {ATG): 4083; Stop codon (TAA): 4941. Plasmid replication region: Site of 
replication initiation: 5702-5704; Region necessary for replication: 5008-5704. 



ORDERING INFORMATION 



Product Quantity Code No. Price (US $) 

pMSG Inducible Mammalian 25 ug 27-4506-01 135.00 

Expression Vector 



References 

1. Lee, F. er al, Nature 2<M, 228 (1981 ). 

2. Mulligan, R. and rW^, P., Proc. Natl A<jA. Scl USA 78, 2072 
(1981). 

3. Klessig, 1). F. etaL At../. Cell. Biol. 4, HS4 (P84). 

4. Hall, C. V. etalj. MlAppl Gen. 2, 1<>] (PS}). 

5. Chapman, A. B. et al . Mol Cell. Biol. 3,1421(1983). 

6. Morley, K. L. et J . Nucl. Acids Res. 15, (1987). 

■ / GenBank Accession Number Ul >Ke>(). 




pCHHO Eukaryotic Assay Vector 

* For monitoring and normalizing expression in eukaryotic 

cells (1). 

tApression: pCHl 10 contains a functional lacT, gene which is expressed 
from the SV40 early promoter in eukaryotes and from the E. coli gfH 
promoter in prol- aryotes (2). When co-trans fected with a second plasmid, 
pCHl 10 provides an internal marker to monitor and normalize transient 
and stahle expression between experiments in eukaryotic cells (3). 
Promoter screening: Insertion of a promoter into the Hind III site allows 
monitoring of [5-galactosidase expression (1,4). 

Host(s): h. colt and mammalian cells (including COS monkey cells, and 
mouse Ltk' cells) 

Selectable marker(s): Plasmid confers resistance to 100 ug/ml ampicillin 

in E. cob. 

Amplification: Recommended. 

Control Regions: SV40 origin and earlv promoter region: Minimal region for 
replication 7(HS-^023, TATA box: 7036-7042, 21 -bp repeats: 6954 7017- 72-bp 
repeats: 6 S 07-6 950; Transcription start region 706 3-7070. gpt gene region: gpt 
gene: 7-208; TATAA: 13-17. Start codon (ATG): 85. trpS gene region: 209-292. 

gene region: lacL gene 295-3605; Stop codon (TAA): 3 340. SV40 polv- 
adcnylation region: PolyA signal sequence. 3666-3671; PolyA site: 3b87. 
P-Uctamase gene region: Promoter region: -10 4651-4656; -35: 462X-4633; Start 
codon (ATG): 4698; Stop codon (TAA) 5556. Plasmid replication region: Site of 
replication initiation: 6317-6319; Region necessary for replication: 5623-6319. 

References 

Miller, J. H. in Experiments in Molecular Genetics (Cold Spring 

Harbor laboratory) (1972), p. 352. 
2 - Hall, C. V. etal.J. Mol Appl Gen. 2, 101 (1983). 
^ Hcrbomcl, P. et al, Cell 39, 653 (1984). 
4 L<*, K etal, Nucl Acids Res. 12, 41 ^1 (1984). 



ORDERING INFORMATION 


Product Quantity Code No 


Price (US $) 


pCH1 10 Eukaryotic Assay Vector 25 ng 27 4508-01 


135.00 



■ / GenBank Accession Number U13845. 
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Epitope Mapping of Anti-recA Protein IgGs by Region Specified 
Polymerase Chain Reaction Mutagenesis* 



i Received tor publication, October 14, 1991 \ 



Masayuki Ikeda, Kaoru Hamano, and Takehiko Shibataf 

From the Laboratory uf Microhmhi^. LIKES (The Institute <>} Lhysical and ('he»ucal Research'. Waht-shi. 
Sait ana - in! -<>I , Japan 



Monoclonal IgGs were shown to be useful for the 
specific inhibition of a set of activities of the recA 
protein, a key protein in homologous genetic recombi- 
nation. The mapping of the epitopes for these IgGs and 
site-directed mutagenesis based on the mapping will 
facilitate location of the functionally active sites on the 
tertiary structure of the protein, which is being solved 
by means of physicochemical techniques. We developed 
a novel technique for region-specified mutagenesis and 
applied the technique to epitope mapping. Using the 
polymerase chain reaction in the presence of deoxyi- 
nosine triphosphate, we introduced random base sub- 
stitutions specifically into a region of the recA gene 
defined by a pair of primers. RecA mutants exhibiting 
altered antigenicity were selected, in plaque-immuno- 
blotting experiments, from libraries of mutagenized 
recA genes constructed on the Xgtl 1 expression vector. 
Mutant recA genes were obtained at the frequency of 
about 10~ 2 among the plaques expressing fused recA 
genes and then each one was expressed as a whole 
protein, which was characterized by enzyme-linked 
immunosorbent assay. Analyzing the DNA sequences 
of the mutant recA genes, we located at the amino acid 
sequence level the epitopes for two anti-recA IgGs 
which could not be located in previous studies. One of 
the antibodies was shown to prevent self-assembly of 
the recA protein and the other was suggested to inhibit 
the binding of double-stranded DNA. Thus, the active 
sites involved in these functions would be located in 
the space around or near the relevant epitope. 



The recA protein, and its prokaryotic and virus (T4 phage) 
analogues promote "homologous pairing" and "strand ex- 
change" between homologous double-stranded and single- 
stranded DNAs through ATP for dATP- ^dependent reac- 
tions in vitro, and were shown to play an essential role in 
homologous genetic recombination in riro. Homologous pair- 
ing is the formation of an mtermolecular duplex ("heterodu- 
plex") between a couple of homologous single-stranded and 
double-stranded DNAs, and strand exchange is the processing 
of the heteroduplex, such as its elongation. Each of these 
reactions consist of a number of substeps and the recA protein 
or its analogues appear to have various active sites that 

* This research was supported by a ^rant from the Hiodesign 
Research Program from RIKKN (The Institute of Physical and 
Chemical Research). The costs of puhlication of this article were 
defrayed in part by the payment of page charges. This article must 
therefore be hereby marked "advertisement" in accordance with Is 
I'.S.C. Section 1 734 solely to indicate this fact. 

i To whom correspondence should be addressed: Laboratory of 
Microbiology, RIKKN, Wako-shi, Saitama Hal -01. Japan. Fax: Sl- 
484-62-4(571. 



promote each of these substeps, such as an ATP-binding site, 
ATPase catalytic center, binding site for single-stranded 
DNA, binding site for double-stranded DNA, and sites for 
self-polymerization. The localization of these active sites on 
the tertiary structure of the recA protein is essential for 
understanding the mechanisms of the underlying biochemical 
functions of the protein. However, only sites related to ATP- 
ase have been partly located at the amino acid sequence level. 
The mapping of mutation sites as well as x-ray crystallo- 
graphic analysis of the protein are the main means to this 
end. A series of our studies involving the use of anti-recA 
protein monoclonal IgGs is also an approach to the same goal 
(see Shibata et a/., 1991, for review). 

We have isolated clones of mouse hybridomas which pro- 
duce anti-recA protein IgGs {Makino et a/., 1985). Two 
(ARM 193 and ARM191) of these anti-recA protein IgGs each 
inhibit a set of activities of the recA protein without affecting 
the others; re. ARM 193 severely inhibits the unwinding of 
the double helix and strand exchange, but allows homologous 
pairing and single-stranded DNA-dependent ATP hydrolysis 
(Ikawa et ai, 1989; Makino et a/., 1985, 1987). On the other 
hand, ARM191 inhibits the homologous pairing and unwind- 
ing of the double helix, but only affects the single-stranded 
DNA-dependent ATP hydrolysis a little (Makino et a/., 1985). 
ARM 193 was suggested to affect the site for the interaction 
between recA polypeptides (Ikawa et a/., 1989) and ARM191 
to affect the site on the recA polypeptide for the binding to 
double -stranded DNA (Makino et a/., 1985). We preliminarily 
located the epitopes for both ARM193 and ARM191 in a C- 
terminal 88 amino acid region ( Phe^'-Glu 147 ) of the recA 
polypeptide by examining the cross-reaction of proteolytic 
fragments. However, we failed to map them more precisely, 
since none of the subfragments of the 88-amino acid region 
exhibited significant cross-reaction with either of the IgGs 
(Ikeda et ai, 1990). Therefore, it was necessary to introduce 
anot her technique to overcome the problem. Here, we describe 
a novel technique for region-specified mutagenesis and, as an 
application of this technique, the mapping of the epitopes of 
ARM 193 and ARM 191 at the amino acid sequence level in 
distinct but slightly overlapped subregions in the C-terminal 
H8- amino acid region. 

MATERIALS AND METHODS 

nrA Trotein - The purified recA protein was fraction V prepared 
as described (Shihata ft al. 19S1 ). 

Oligonucleotides Oligonucleotides were synthesized with a DNA 
synthesizer (I)u Pont -New Kngland Nuclear 0ODER300) and puri- 
fied with NKNSORB PRKP (I)u Pont -New Kngland Nuclear). 

Techniques far (laning of PSAs Treatment with restriction en- 
doruieleases and DNA ligase, and the isolation and cloning of the 
DNA fragments onto vectors were carried out as described (Herger 
and Kimmel, 19S7; Maniatis et ai. 1982). 

Antibodies and Immunochemical Techniques -The anti-recA pro- 
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Kic. 1. Region-specified PCR mutagenesis. UNA encoding 
the E coii recA gene Hanked bv primers 1 and 2 was amplified by 
PCR in the presence of dITP with the use of Taq DNA polymerase. 
EcnHl fragments of the amplified DNA which encoded the ('-terminal 
region of the recA protein were cloned on a ,\gtll expression vector 
to construct DNA libraries of the mut agnized recA genes. With 
appropriate orientation of a fragment relative to the vector, the ( - 
terminal region of the recA gene was connected to the lacZ gene in- 
frame Proteins expressed m the plaques obtained from the libraries 
were transferred to a pair of membranes and then the cross-reaction 
with either anti-recA protein IgG ARM191 or ARM 19.1 was tested. 
The plaques showing cross-reaction with only one of the IgGs Vindi- 
cated by arroirs) were picked up and subjected to further cross- 
reaction tests. The closed circles in the big cirrtes at the bottom ol the 
figure denote plaques which showed cross-reaction with the indicated 
IgG. 
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Kid. 2 Structure of P MI996 for expression of mutated recA 
genes. DNA encoding the N-terminal region of the recA polypeptide 
was put under the control of the Tac promoter on a multicopy plasmid 
(pKK223-3). EcoRI fragments of the mutagenized recA gene were 
inserted at the EcoRI site at the codons for Glu ' '-Phe " . 

tern monoclonal IgGs, ARM191. ARM193. and ARM414. were de- 
scribed previously. We used affinity purified preparations of these 
IgGs An anti-recA protein monoclonal IgG, MAblofi. was isolated 
bv Karu and Allen ( Karu and Allen, 19S2), and a purified preparation 
of this IgG was provided by Dr. Alexander Karu (University of 
California. Berkeley) and Dr. A. John Clark I University ot California, 
Berkeley). , 

The enzyme-linked immunosorbent assay (KLISA)' was carried 
out as described previously Unless otherwise stated, samples of cell- 
free extracts were diluted in PBS (50 mM potassium phosphate buffer 




1 The abbreviations used are: ELISA, enzyme-linked immunosor- 
bent assay; PBS, phosphate-buffered saline; PCR, polymerase chain 
reaction. 
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Fit, 3 Absence of competition in cross-reaction between 
anti-recA protein IgGs, ARM 191 and ARM 1 93. The wells of a 
microtiter plate were coated with the purified ret A protein. Then, a 
solution of a tested anti-recA protein (mmdilution: 1.5 ^/m\) con- 
taining the other IgG was put into the well* and allowed to cross- 
react with the recA protein. ARM191 belongs m IgG, and ARM19:- 
to MV, Thus the amounts of ARM 191 and ARM193 bound to the 
recA protein were specifically measured by ELISA with the use of an 
appropriate subclass-specific antibody A U th»- binding of ARM191 
in the absence of ARM193; ■, thr binding « >f ARM 191 in the presence 
of 5 Mg of ARMl93/ml. B: O. the binding «.f AKM193 in the absenc< 
of ARM191; the binding of ARM193 m thr presence of 5 Mg o 
ARMl91/mI. 

(p H 7 containing 150 mM NaCl) and put into the sample wells o 
a microtiter plate with 96 wells. On the other hand, the concentrate 
of the anti-recA protein IgG was first adjusted to 30 ^ of protein/m 
Cno dilution") in PBS Tween (PBS containing 0.05% Tween 20) am 
then a series of dilutions was prepared in PBS/Tween. 
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Vie.. 4. Testing of cross-reaction of proteins in plaques ob- 
tained from libraries of mutagenized DNA. The results of two 
experiments are shown as examples (A and H). Al and HI show 
cross-reaction with ARM 191, and A2 and H2 that with ARM 193. The 
procedure was described in detail in the legend to Fig. 1. Arrowheads 
indicate mutants which showed altered cross-reaction; mutant D (the 
top ones in Al and A2) shows no cross-reaction with ARM 191, and 
mutant 47 \H1 and H2) none with ARM 193. 
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Testing of the Competition between ARM 191 and A R Mil 93— The 
IgGs, ARM 191 and ARM 193, belong to subclasses 1 and 2b, respec- 
tively (Ikeda et al., 1990), and thus each could be assayed by use of 
anti- mouse I^G, and ant i- mouse IgG Jtl antibodies, respectively. A 
solution of the purified recA protein (0.2 ^g/ml) was put into the 
wells of a microtiter plate to coat the walls of the wells. Each solution 
of an indicated amount i no dilution: 1.5 ng/m\) of a tested anti-recA 
protein IgC 1 10 ^1) contained 0 or 5 ^g of the other anti-recA protein 
IgG/ml. The amounts of the tested anti-recA protein IgG that bound 
to the wells were measured by ELISA with anti -mouse IgG, or anti- 
mouse IkGi.v, antibodies l Zymed Laboratories, Inc.}. 

Region-specified PCR Mutagenesis — An outline of the procedure is 
given in Fig. 1. pBKl'14 UNA, which contains the Escherichia coli 
recA gene (Chlin and (Mark, 1981). was linearized with BamH\ and 
tised as the template for PCR (polymerase chain reaction). A DNA 
region encoding the recA gene flanked bv primer 1 (5'-ATGGCT- 
ATCGACGAAAACAA-3') and primer 2 (5'-GAATTCTGTCATG- 
(iCATATCCTT-3' ) was amplified by 25 cycles of PCR (see Fig. 1). 
I'nless otherwise stated, the reaction mixture for PCR comprised 1 
M M each of primers 1 and 2. about 3 pM the template DNA, 200 
each of dATP. dTTP. dGTP, and dCTP, 200 deoxyinosine 5'- 
triphosphate (dlTP), 0.025 units of Taq DNA polymerase/^1, 1.5 mM 
MgCl,, 50 mM KC1, 0.001 c i gelatin, and 10 mM fris-HCl buffer (pH 
8.3). Kaeh cycle of PCR consisted of: (i) incubation at 37 °C for 2 
inin. for loading of the primers onto the template DNA, (ii) incubation 
at 72 C C for 3 min, for polymerization, and (iii) incubation at 94 T 
tor 1 min. for denaturation. The amplified DNA was treated with 
AVoRI and the EcnHl fragment encoding the C-terminal 93-amino 
acid region was isolated by gel electrophoresis, followed by trapping 
on a DEAE membrane (NA45. Schleicher & Schuell). Then, DNA 
libraries containing the mutagenized AVoRI fragments were con- 
structed on the Xgt 1 1 expression vector by ligating them at an rVeoRI 
site of the vector. With appropriate orientation of a fragment relative 
to the vector, the ('-terminal region of the recA gene was connected 
to the lacZ gene in-frame. Plaques of the phages in the libraries were 
obtained with the E. coli Y1090 strain as a host. Proteins in the 
plaques were t ransferred to a pair of membranes and then the cross- 
reaction with either ARM191 or ARM 193 anti-recA protein IgG was 



examined by means ot unmTWoblotting experiments, a- described 
previously I Ikeda et ai, 1990; Morishmia et al., 1990). The plaques 
showing cross-react ion wit h only one of the Ig( Is ( indicated by arrow* 
in Fig. 1; examples are shown in Fig. D were picked up, and I he 
phages were obtained after repeated single plaque isolation and tot 
ing by means of immunoblotting experiments. 

DNA Sequence Analysis- -'Yhe tested DNAs were cloned on 
pU(T 19 and then their sequences were analyzed by the dideoxyribo 
nucleotide chain termination method (Sanger et al., 1977); the label- 
ing reaction was carried out by use of the double -stranded template 
according to a manual for Sequenase (Cnited States Biochemical Co., 
Cleveland, OH), and the products were analyzed with the use of an 
automated DNA sequence analyzer (Du Pont GENESIS2000). We 
analyzed both strands in most of the cases. 

Expression of Mutant recA Genes and Preparation of Cell- free 
Extracts— EcoHl fragments which carried mutation! s) in the C ter- 
minal 93-ammo acid region were ligated to the AVoRl site of the DNA 
region encoding the N-terminal 2G0 amino acids, which was under 
the control of the Tac promoter on a multicopy plasmid, pMl990 
(Fig. 2), a derivative of pKK223-3 (Brosius and Holy. 1984). An E. 
coli strain, MVT 184 (deletion of the srl-recA locus) (Vieira and Mess- 
ing, 1987), was transformed with the DNA. The transformants were 
grown at 37 C to the mid-logarithmic growth phase in 3 ml cultures 
and then expression of the mutated recA genes was induced by 
isopropyl-^-It-thiogalactoside treatment (0.2 mg/ml) for 2 h. After 
the treatment, 'he cells were collected by centrifugal ion and sus- 
pended in 200 >d of a lysis buffer (10 mM Tris-HCl (pH 8.0), 1 mM 
EDTA. 0 1 M NaCl, anti 5 f 7 Triton X-100), and then treated with 
lysozyme (O.b mg/ml) at 0 C C for 15 min, followed by the addition of 
KC1 (at 0.24 M) The lysate was centrifuged at about 15,000 > g for 
15 min and the supernatant of the lysate was saved. The supernatant 
was diluted about 20-fold with PBS and then subjected to ELISA as 
described above. In the case of mutants 2 and 38, the precipitates of 
the lysates were resuspended in 5 M urea containing 100 mM I ris 
HC1 (pH 7 9), followed by eentrifugat ion, and the mutant proteins 
were extracted from the precipitates with 100 mM Tris-HCl buffer 
(pH 7.9( containing 8 m urea and 0.1 M NaCl at 25 "C for 1 h. 

RKSULTS 

Testing of the Competition between ARM191 and ARM J93-- 
The preliminary mapping of the epitopes of ARM 191 and 
ARM 193 indicated that both epitopes were located between 
Phe'^' and Glu u: (Ikeda et al., 1990). First, we examined 
whether or not these anti-recA protein-IgGs showed compe- 
tition in the cross-reaction with the recA protein. Since 
ARM 191 and ARM 193 belong to different IgG subclasses, 
each can be discriminated through the use of subclass-specific 
antibodies on ELISA. Fig. 3 indicates that the presence of 
one of these IgGs did not affect the binding of the other Ig< J 
to the recA protein. We conclude that the epitope for ARM 191 
and ARM 193 are different. Thus, we tried to locate the 
epitopes of these anti-recA protein IgGs more precisely. 

Development of Region- specified PCR Mutagenesis and Iso- 
lation of Mutants—Since we were not able to locate the 
epitopes of ARM191 and ARM193 by examinations of the 
cross- react ion of subfragments of the recA polypeptide, we 
developed a novel technique for region -specified mutagenesis 
which could be applied for mapping of the epitopes. The whole 
process for the isolation of mutants causing altered cross- 
reaction consists of three stages (Fig. 1); the introduction of 
region-specified random base substitutions by means of PCR, 
construction of libraries of the mutagenized DNA with the 
use of the Agtll expression vector, and in situ testing for 
cross-reaction of the mutated polypeptides expressed in the 
plaques obtained from the libraries. In PCR (Saiki et al., 
1985), Taq DNA polymerase causes the misincorporation of 
nucleotides (Eckert and Kunkel, 1990). We added deoxyino- 
sine 5 '-triphosphate (dlTP), at 200 ^M, to the reaction mix- 
ture for PCR to enhance the misincorporation (Martin and 
Castro, 1985), and specifically amplified a DNA region defined 
by a pair of oligonucleotides (primers 1 and 2). Primer 1 
includes the initiation codon of the recA polypeptide and 
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primer 2 includes a complementary sequence about 80 bases 
downstream of its stop codon (Fig, 1). Primer 2 was designed 
so as to have an EcoRl cutting site. At more than 200 ^M, 
dITP severely inhibited the amplification ot DNA by PCR 
(data not shown). After 25 cycles of PCR, the amplified DNA 
was treated with EcoRl and the EcoRl fragment encoding the 
region of the recA polypeptide extending from Phe' 1 " to the 
C-terminal Phe was isolated. The fragment was inserted into 
the EcoRl site of the Xgtll expression vector (Young and 
Davis, 1983). The proteins in the plaques were examined as 
to their in situ cross-reaction with anti-recA protein Igds 
ARM191 and ARM 193. As shown in Fig. 4, most of the 
plaques expressing the fused recA polypeptide showed cross- 
reaction to both IgGs. We could pick up about V '< ot these 
plaques as candidates of mutants which showed cross react ion 
to only one of the IgGs (indicated by arrowheads in Fig. 4; 
Table I). 

From the selected plaques, phages were recovered and the 
inserted EcoRl fragments were reisolated. The DNA frag- 
ments were recloned on the pUC119 vector < Vieira and Mess- 
ing, 1987) and their DNA sequences were analyzed by the 
dideoxvribonucleotide chain termination method (Sanger et 
a/., 1977). We analyzed both strands in most of the cases by 
use of double-stranded form and an appropriate primer. The 
triplets which were changed by mutations are shown in Table 
II. During the development of the new technique for muta- 
genesis, we obtained 19 mutants exhibiting altered cross- 



reaction to either ARM191 or ARM193 from among the 52 
plaques tested (Table I). So far as defined, all mutations were 
the substitution of amino acid(s) caused by base substitu- 
tions) (Table II). Some of the mutants have a second (third) 
base substitution which does not affect the amino acid se- 
quence. Of the base substitutions defined, 83% are transitions 
and I7 r r transversions. 

On the first screening, the addition of dITP appeared tc 
have onlv a little effect on the PCR mutagenesis (Table I) 
However, when we characterized the mutants isolated, we 
found that the addition of dITP is essential for this mutagen- 
esis (Table I). Without dITP or with 0.2 mm dITP, we pickec 
up 5 and 21 plaques, respectively, on the first screening, but 
obtained only 3 kinds of mutants. When dITP was added ai 
200 uM, we picked up 21 plaques from among about 200( 
plaques expressing the fused recA polypeptide, and from thes< 
groups we finally isolated 11 mutants which were categorizec 
into 10 kinds. Three of them (mutants 30, 33, and 36) hat 
base substitutions at two or three sites (see Table II). Thes< 
findings indicate the significant enhancement of the yield anc 
variation on mutagenesis with dITP, and the very high yieb 
of base substitutions under an appropriate set of conditions. 

Some mutants have base substitutions at the same sites 
Except mutants 23 and 34 (substitution of Ser :m ), such over 
lapped mutants were isolated from the same PCR prepare 
tions. Thus, these overlapped mutations seem to be create 
at an early evele in PCR and amplified during the procedun 
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Fu. Variation of the antigen concentration on KLISA 
does not change the amount of Ig<i giving a half-maximum 
signal. The wells nt a micrutiter plate ui-re coated with the purified 
recA protein at the indicated com em rat i>ms. On the other hand, the 
concentration of the anti-recA protein k<i (AKM191 in A\ ARM 193 
in H) was adjusted m M) ^/ml r nondilut ion"), and then a series of 
dilutions of the was put in'o indi\ idual wells. The bound IgG 
was measured by KLISA. The concentrations of the recA protein 
were as follows: [.1. 2 ut; ml; O. 0 2 pg, nil. A, 0.1 ^<:/ml; A. 0.0f> ^/ 
nil; ■, 0.02 jig/ml; •. without ret A protein. 

Except for in these cases, base subst itutions appeared to be 
introduced at random in the amplified DNA. 

Cross-reaction of Mutant recA Pnrteins—We constructed a 
plasmid (pMI996; Fig. 2) in which the wild type recA gene 
was under the control of the Tac promoter on a multicopy 
plasmid (a derivative of pKK223-3 (Brosius and Holy, 1984). 
We replaced the EcoRl-EcoRl region encoding the C -terminal 
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Fi<;. 0 Examples of ELISA for testing the cross- reaction of 
mutant recA proteins to anti-recA protein IgGs, ARM 191, 
ARM 193, and MAblf>6. Cell -free extracts were prepared from 
cells in which the mutant rvcA genes were expressed; A, wild type; B, 
mutant 3S; (\ mutant I), mutant 31. The cell-free extracts were 
put into individual wells of a mierotiter plate to allow the adsorption 
of proteins. Then, the cross react ion to each of the IgGs was examined 
by KLISA. as described in the legend to Fig. 5. The IgGs used were: 
• ARM 191; O. ARM 1 9:1; A. MAbloO. Anti-recA protein IgG, 
ARM tl 4. of which epitopes is located between Glir 11 and Lys'''' 1 
i Ikeda ft aL, 1990> was used as a positive control ( + ). 

93 amino acids of the wild type recA polypeptide with the 
rVcoRI fragments on which we found mutation(s) (Fig. 2). 
Then, a mutant of E. coli in which the whole recA gene was 
deleteci was transformed with these plasmids. The expression 
of the mutant recA genes was induced and cell- free extracts 
were prepared, followed by quantitative assaying (ELISA) for 
cross- react ion with ARM 191 and ARM 193. For comparison 
of the extent of cross- react ion of a tested IgG with the recA 
protein, we determined the amount of the IgG giving a half- 
maximum signal on ELISA. As shown in Fig. 5, variations in 
the amount of recA protein did not significantly change the 
amount of the IgG which gave the half-maximum signal on 
ELISA. We calculated the amounts of ARM191 and ARM193 
giving a half-maximum signal to be 0.0277 {a = 0.017) and 
0.29 {a — 0.10) /ig/ml, respectively, from the data in this 
figure. 
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Fu;. 7. Mutation sites and alterations in cross-reactivity. 

The numbers under the abscissa are the amino acid position.- from 
ihe N terminus of the recA polypeptide. Cross-react ivity is drtmcd 
as follows: (the amount of an I^'l giving the halt-maximum value on 
KLISA with wild-tvpe recA protein )/(the amount o! an -mr^ 
the half-maximum value on KLISA with mutant recA protein) 1 bus. 
it shows the deficiency in the cross-reaction. •. cros> reaction with 
ARM191; O, that with ARMIS* A, that with MAhl--»(i. Symbol* 
within the shadowed area indicate that no cross reaction was detected 
(within the limit of the measurement, which is indicated by each 
sxmbol). The symbols with a horizontal arrow around position W'li) are 
those for a mutant having two amino acid substitutions at positions 
H19 and 323, and those with * at position 330 are those tor a mutant 
having two amino acid substitutions at positions '1\W and 'XM\. 

Examples of ELISA with mutant recA proteins as well as 
the wild type protein are shown in Fig. f>. The values calcu- 
lated on ELISA are listed in Table II. In order to obtain the 
half-maximum signals on ELISA, all proteins (except mutant 
38) which had been determined to be mutants by nnmuno- 
blotting experiments were shown to require at least 10-fold 
more IgG ARM191 or ARM193 than the average for plus 
proteins (Fig. 7). The results shown in Fig. 7 clearly indicate 
that the mutants affecting the cross-reaction with ARM 191 
and those affecting that with ARM193 are separately clus- 
tered, and only slightly overlap each other (between positions 
:U5 and about 320). This conclusion is consistent with the 
absence of competition in the binding of these lgds to the 
recA protein (Fig. 3). 

Comparison of MAbl56 with ARM 191 and ARM Mi - 
MAbl56 was isolated by Karu and Allen (Karu and Allen. 
1982). Since it was assumed that the epitope of this antibody 
is located near the C terminus of the recA polypeptide, we 
examined the cross-reaction of MAbloG with the mutant recA 
proteins isolated in the present study. All of the mutations 
affecting the cross-reaction with ARM 191 also affected the 
cross-reaction with MAbl56. but none of the mutations af- 
fecting the cross-reaction with ARM 19H did. This suggests 
that the epitope for MAbl56 and that of ARM 191 are similar, 
but the following results show that they are not identical. 
Most of the mutations abolishing the cross- react ion with 
ARM191 also abolished that with MAblob <Fm 7>, but sub- 
stitution of Leu'""' by Fro in mutant :?8 strongly interfered 
with the cross-reaction with MAblo6, but not so much with 
that with ARM191 (Figs. 6B and 7). The difference in the 
mode of cross-reaction between ARM191 and MA blob* was 
also observed with a mutant recA protein having a substitu- 
tion of Ala™ by Gly (Fig. 7). 

DISCISSION 

Region-specific PGR mutagenesis is an efficient tool for 
introducing random base substitution mutations specifically 

- A. -I. Clark and A. Karu, personal communication. 



in a region defined by a pair of primers. From among the 
plaques expressing the fused recA gene, we obtained mutants 
of the recA gene at the frequency of 0..V7 under the optimized 
conditions. Using this mutagenesis, we identified regions of 
the recA polypeptide in which amino acid substitutions pre- 
vent the cross-reaction with ARM191, ARM193, and/or 
MAbloB. The region for ARM191 and that for ARM193 are 
different, but slightly overlap each other; i e. that for the 
former IgG comprises positions 283 through about 320, and 
that for the latter positions through 315-338. Since ARM191 
and ARM 193 showed no competition, Thr ll \ and Ile :,iy and 
lie" 1 '' and/or VaF ;1 would be located on different sides of a 
local structure or the whole molecule of the recA polypeptide. 

The substitution of an amino acid might interfere with the 
cross-reaction with an IgG directly or through a very local 
change in the tertiary structure, or indirectly through exten- 
sive alteration of the tertiary structure of the recA protein. 
The Leu'"' to Fro substitution in mutant 38 could cause gross 
alteration of the tertiary structure of the recA protein. The 
clustering of other amino acid substitutions in a particular 
region is favorable for the first two possibilities rather than 
the last one. ARM193 was shown to prevent the self-assembly 
of the recA protein and ARM191 to inhibit the binding of 
double-stranded DNA. Thus, the active sites involved in these 
functions would be located in the space at or around the 
relevant epitope. We are testing these possibilities by exam- 
ining the effects of the mutant recA proteins isolated in this 
study and those constructed by another round of region- 
specified PGR mutagenesis. Determination of the epitope loci 
and the inhibitory effects of these IgGs, together with the 
tertiary structure of the recA protein, will facilitate the un- 
derstanding of the function of the protein in relation to its 
structure. 

The technique of region -specified random base substitu- 
tions involving the use of PGR employed in this study is very 
useful not only for epitope mapping, as described in this 
paper, but is also widely useful for studies on the function of 
a gene and an enzyme or protein, because of the flexibility as 
^specifying a target region, and the high yield of the random 
base substitutions. 
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